Source

SDTM Version 1.4¶

Prepared by the CDISC Submission Data Standards Team

Notes to Readers

This is Version 1.4 of the Study Data Tabulation Model Document (SDTM). This document includes additional variables related to human clinical trials and animal studies, and other corrections and clarifications to the text. A full description of all changes from the prior version is provided in Section 7.1.

Revision History

Date	Version	Summary of Changes
2013-11-26	Version 1.4 Final	Final Version
2012-07-16	Version 1.3 Final	Final Version
2008-11-12	Version 1.2 Final	Final Version
2005-04-28	Version 1.1 Final	Final version incorporating minor corrections to address comments submitted during public review period.
2004-06-25	Version 1.0	First released version reflecting all changes identified during comment periods.

Note: Please see Appendix A for Representations and Warranties, Limitations of Liability, and Disclaimers.

1. Introduction¶

1.1 Purpose¶

This document describes the Study Data Tabulation Model (SDTM), which defines a standard structure for study data tabulations that are to be submitted as part of a product application to a regulatory authority such as the United States Food and Drug Administration (FDA). This document is based on material prepared by the Submissions Data Standards (SDS) Team of the Clinical Data Interchange Standards Consortium (CDISC). This document, which will supersede all prior versions, includes numerous changes from the prior Version 1.3, which are described in Section 7.1.

Data tabulation datasets are one of four ways to represent the human subject Case Report Tabulation (CRT) and equivalent animal data submitted to the FDA. CRTs are also submitted in the format of subject profiles, data listings, and analysis datasets. One benefit to industry of submitting data tabulation datasets that conform to the standard structure is that it minimizes the need to submit the same data in multiple formats.

The availability of standard submission data may provide many benefits to regulatory reviewers. Reviewers can now be trained in the principles of standardized datasets and the use of standard software tools, and thus be able to work with the data more effectively with less preparation time. Another benefit of the standardized datasets is that they can provide support for the FDA's efforts to develop a repository for all submitted studies and a suite of standard review tools to access, manipulate, and view the study data.

This document is intended for companies and individuals involved in the collection, preparation, and analysis of study data submitted to regulatory authorities. Guidance, specifications, and regulations for the application of this model will be provided separately by regulatory authorities. Audiences are advised to refer to these documents before preparing a regulatory submission based on the SDTM.

1.2 Relationship to Prior CDISC Models¶

This document is a successor of what was known in prior versions as the CDISC Submission Data Standards or Submission Domain Models. While Version 1.0 SDTM was designated as the first implementation-ready version for clinical studies involving human drug products, improvements and enhancements have been incorporated in subsequent versions to support a broader range of regulated products, including the needs of non-clinical animal toxicity studies. Efforts will continue to further evaluate the model for human and animal studies involving other regulated products including food additives; therapeutic biologics; blood derivatives; vaccines; cellular, tissue, and gene therapy; and devices. Structured evaluation pilots of the SDTM are planned for these products, and the lessons learned from these pilots would be used in developing future enhancements to the standard. Implementation guides for applying the model to each type of data and guidance on controlled terminology will be published separately.

1.3 Significant Changes from Prior Versions¶

The SDTM has been designed for backward compatibility; datasets prepared with V1.4 should be fully compatible with prior versions. In most cases, this means that later versions may add new variables or correct textual errors, but do not eliminate variables or structures incorporated in prior versions. There are, however, isolated instances where more recently added SDTM variables are recognized as being more functional than older ones. In order to help ensure more consistency in implementations, the less functional variables are deprecated. Section 7.2 lists planned variable deprecations to take effect in a future SDTM release. In addition to including several text corrections and clarifications, V1.4 has been expanded to include 1) new tables to relay data about persons who are not the "subjects" of a study (referred to as Associated Persons), an additional Trial Design table to describe the disease assessment schedule in some oncology trials, and 2) new variables in the general observation class tables.

The following new sections have been added:

•Section 3.5 - The Trial Disease Assessments

•Section 5 - Applying Model Fundamentals to Associated Persons

The following sections have been re-numbered within SDTM V1.4:

•Section 6 "Using the Model for Regulatory Submissions" - was previously "Section 5" in SDTM V1.3

•Section 7 "SDTM Version History" - was previously "Section 6" in SDTM V1.3

The following new tables have been included in Version 1.4:

•Table 3.5.1 - Trial Disease Assessments

•Table 5.1.1 - Associated Persons Data – Identifier Variables

•Table 5.2.1 - APRELSUB Table

New variables have been added to the following sections:

•Table 2.2.1 - Interventions

•Table 2.2.2 - Events

•Table 2.2.3 - Findings

•Table 2.2.4 - Identifiers

•Table 2.2.5 - Timing Variables

2 Model Fundamentals¶

2.1 Model Concepts and Terms¶

The SDTM provides a general framework for describing the organization of information collected during human and animal studies and submitted to regulatory authorities. The model is built around the concept of observations, which consist of discrete pieces of information collected during a study. Observations normally correspond to rows in a dataset. A collection of observations on a particular topic is considered a domain. For example, "Subject 101 had mild nausea starting on Study Day 6" is an observation belonging to the Adverse Events domain in a clinical trial.

Each observation can be described by a series of named variables. Each variable, which normally corresponds to a column in a dataset, can be classified according to its Role. A Role describes the type of information conveyed by the variable about each distinct observation and how it can be used. SDTM variables can be classified into five major roles:

•Identifier variables, such as those that identify the study, the subject (individual human or animal or group of individuals) involved in the study, the domain, and the sequence number of the record.

•Topic variables, which specify the focus of the observation (such as the name of a lab test).

•Timing variables, which describe the timing of an observation (such as start date and end date).

•Qualifier variables, which include additional illustrative text, or numeric values that describe the results or additional traits of the observation (such as units or descriptive adjectives).

•Rule variables, which express an algorithm or executable method to define start, end, or looping conditions in the Trial Design model.

The set of Qualifier variables can be further categorized into five sub-classes:

•Grouping Qualifiers are used to group together a collection of observations within the same domain. Examples include –CAT and –SCAT.

•Result Qualifiers describe the specific results associated with the topic variable in a Findings dataset. They answer the question raised by the topic variable. Result Qualifiers are –ORRES, –STRESC, and –STRESN.

•Synonym Qualifiers specify an alternative name for a particular variable in an observation. Examples include –MODIFY and –DECOD, which are equivalent terms for a –TRT or –TERM Topic variable, and –TEST for –TESTCD.

•Record Qualifiers define additional attributes of the observation record as a whole (rather than describing a particular variable within a record). Examples include –REASND, AESLIFE, and all other SAE flag variables in the AE domain; AGE, SEX, and RACE in the DM domain; and –BLFL, –POS, –LOC, –SPEC, and –NAM in a Findings domain

•Variable Qualifiers are used to further modify or describe a specific variable within an observation and are only meaningful in the context of the variable they qualify. Examples include –ORRESU, –ORNRHI, and –ORNRLO, all of which are Variable Qualifiers of –ORRES; and –DOSU, which is a Variable Qualifier of –DOSE.

For example, in the observation, "Subject 101 had mild nausea starting on Study Day 6", the Topic variable value is the term for the adverse event, "NAUSEA". The Identifier variable is the subject identifier, "101". The Timing variable is the study day of the start of the event, which captures the information, "starting on Study Day 6", while an example of a Record Qualifier is the severity, the value for which is "MILD". Additional Timing and Qualifier variables could be included to provide the necessary detail to adequately describe an observation.

Most of the data collected in a study is about the subjects who are enrolled in the study. Sometimes, however, data is collected about other persons (Associated Persons, APs) who can be associated with the study, a particular study subject, or a device used in the study. Associated Persons may or may not have a familial relationship to a study subject.

Observations about study subjects are normally collected for all subjects in a series of domains. A domain is defined as a collection of logically related observations with a common topic. The logic of the relationship may pertain to the scientific subject matter of the data or to its role in the trial. Each domain dataset is distinguished by a unique, two-character code that should be used consistently throughout the submission. This code, which is stored in the SDTM variable named DOMAIN, is used in four ways: as the dataset name, the value of the DOMAIN variable in that dataset, as a prefix for most variable names in that dataset, and as a value in the RDOMAIN variable in relationship tables.

All datasets are structured as flat files with rows representing observations and columns representing variables. Each dataset is described by metadata definitions that provide information about the variables used in the dataset. The metadata are described in a data definition document named "define" that is submitted with the data to regulatory authorities. (See the Case Report Tabulation Data Definition Specification [Define-XML], available at www.CDISC.org). Define-XML specifies seven distinct metadata attributes to describe SDTM data:

•The Variable Name (limited to 8 characters for compatibility with the SAS Transport format)

•A descriptive Variable Label, using up to 40 characters, which should be unique for each variable in the dataset

•The data Type (e.g., whether the variable value is a character or numeric)

•The set of controlled terminology for the value or the presentation format of the variable (Controlled Terms or Format)

•The Origin of each variable

•The Role of the variable, which determines how the variable is used in the dataset. Roles include Identifiers, Topic, Timing, and the five types of Qualifiers.

•Comments or other relevant information about the variable or its data included by the sponsor as necessary to communicate information about the variable or its contents to a regulatory agency.

Data stored in these variables include both raw (as captured by the data provider) and derived values (e.g., converted into standard units or computed, such as age). The SDTM describes the name, label, role, and type for the standard variables.. The origin attribute has controlled terminology defined by CDISC as do values for many of the SDTM variables. Note that current types are restricted to character and number for compatibility with SAS Version 5 transport files; it is expected that additional, more descriptive data types (e.g., integer, float, date, date/time) will be used in the future when the SAS v5 transport requirement is changed to a newer version or to a different format such as XML.

When creating submissions, a sponsor may drop certain variables (those defined as permissible in the implementation guide) from the dataset and the corresponding descriptions from the Define-XML, as long as no data was collected for these variables. New sponsor defined variables must not be added, and existing variables must not be renamed or modified for novel usage. Sponsors should consult the appropriate implementation guide, which specifically describe which variables are required, expected, or permissible to use in specific domains based on the general observation classes.

2.2 The General Observation Classes¶

The majority of observations collected during a study can be divided among three general observation classes: Interventions, Events, or Findings:

•The Interventions class, described in Table 2.2.1, captures investigational, therapeutic and other treatments that are administered to the subject (with some actual or expected physiological effect) either as specified by the study protocol (e.g., "exposure"), coincident with the study assessment period (e.g., "concomitant medications"), or other substances self-administered by the subject (such as alcohol, tobacco, or caffeine).

•The Events class, described in Table 2.2.2, captures planned protocol milestones such as randomization and study completion, and occurrences, conditions, or incidents independent of planned study evaluations occurring during the trial (e.g., adverse events) or prior to the trial (e.g., medical history).

•The Findings class, described in Table 2.2.3, captures the observations resulting from planned evaluations to address specific tests or questions such as laboratory tests, ECG testing, and questions listed on questionnaires. The Findings class also includes a sub-type "Findings About" which is used to record findings related to observations in the Interventions or Events class.

Datasets based on any of the general observation classes share a set of common Identifier and Timing variables. The set of Identifier variables used is described in Table 2.2.4. The set of Timing variables that should be used for all three general observation classes is included in Table 2.2.5. As a general rule, any valid Identifier or Timing variable is permissible for use in any submission dataset based on a general observation class.

In the tables in this section, the presence of two hyphens before the variable name (e.g., –TRT) is used to indicate the required use of a prefix based on the two-character domain code. The domain code is used as a variable prefix to minimize the risk of difficulty when merging/joining domains for reporting purposes.

In addition to the three general observation classes, a submission will generally include a set of other special-purpose datasets of specific standardized structures to represent additional important information. Examples include the following:

•A Demographics special-purpose domain is included with human and animal studies, described in Section 2.2.6.

•Other special purpose domains such as Comments (Section 2.2.7), Subject Elements (Section 2.2.8), Subject Visits (Section 2.2.9).

•Datasets to describe the design of a trial, described in Section 3.

•Datasets to represent the relationships between datasets and records described in Section 4.

The SDTM is the foundation for many implementations. Examples include the SDTM Implementation Guide for Human Clinical Trials and the SEND (Standard for the Exchange of Nonclinical Data) Implementation Guide. Not all variables described in the tables in this document (SDTM Tables) are appropriate for all implementations. Please refer to the implementation guides for specific information on any restrictions.

2.2.1 The Interventions Observations Class¶

Table 2.2.1: Interventions — Topic and Qualifier Variables, One Record per Constant-Dosing Interval or Intervention Episode

Variable Name	Variable Label	Type	Role	Description
Topic Variable
–TRT	Name of Treatment	Char	Topic	The topic for the intervention observation, usually the verbatim name of the treatment, drug, medicine, or therapy given during the dosing interval for the observation.
Qualifier Variables
–MODIFY	Modified Treatment Name	Char	Synonym Qualifier of –TRT	If the value for –TRT is modified for coding purposes, then the modified text is placed here.
–DECOD	Standardized Treatment Name	Char	Synonym Qualifier of –TRT	Standardized or dictionary-derived name of the topic variable, –TRT, or the modified topic variable (–MODIFY), if applicable. Equivalent to the generic drug name in WHO Drug, or a term in SNOMED, ICD9, or other published or sponsor-defined dictionaries.
–MOOD	Mood	Char	Record Qualifier	Mode or condition of the record (e.g., SCHEDULED, PERFORMED).
–CAT	Category	Char	Grouping Qualifier	Used to define a category of topic-variable values.
–SCAT	Subcategory	Char	Grouping Qualifier	Used to define a further categorization of –CAT values.
Variable Name	Variable Label	Type	Role	Description
–PRESP	Pre-specified	Char	Variable Qualifier of –TRT	Used when a specific intervention is pre-specified on a CRF. Values should be "Y" or null.
–OCCUR	Occurrence	Char	Record Qualifier	Used to record whether a pre-specified intervention occurred when information about the occurrence of a specific intervention is solicited.
–STAT	Completion Status	Char	Record Qualifier	Used to indicate when a question about the occurrence of a pre-specified intervention was not answered. Should be null or have a value of NOT DONE.
–REASND	Reason Not Done	Char	Record Qualifier	Reason not done. Used in conjunction with –STAT when value is NOT DONE.
–INDC	Indication	Char	Record Qualifier	Denotes the indication for the intervention (e.g., why the therapy was taken or administered).
–CLAS	Class	Char	Variable Qualifier of –TRT	Class for a medication or treatment, often obtained from a coding dictionary.
–CLASCD	Class Code	Char	Variable Qualifier of –TRT	Used to represent code for –CLAS.
–DOSE	Dose	Num	Record Qualifier	Amount of –TRT given. Not populated when –DOSTXT is populated.
–DOSTXT	Dose Description	Char	Record Qualifier	Dosing information collected in text form. Examples: <1 per day, 200-400. Not populated when –DOSE is populated.
–DOSU	Dose Units	Char	Variable Qualifier of –DOSE, –DOSTXT or –DOSTOT	Units for –DOSE, –DOSTOT, or –DOSTXT (Examples: ng, mg, mg/kg).
–DOSFRM	Dose Form	Char	Variable Qualifier of –DOSE, –DOSTXT or –DOSTOT	Dose form for the treatment. Examples: TABLET, CAPSULE.
–DOSFRQ	Dosing Frequency per Interval	Char	Variable Qualifier of –DOSE, –DOSTXT or –DOSTOT	Usually expressed as the number of doses given per a specific interval. Examples: Q2H, QD, PRN.
–DOSTOT	Total Daily Dose	Num	Record Qualifier	Total daily dose of –TRT using the units in –DOSU. Used when dosing is collected as Total Daily Dose.
–DOSRGM	Intended Dose Regimen	Char	Variable Qualifier of –DOSE, –DOSTXT or –DOSTOT	Text description of the (intended) schedule or regimen for the Intervention. Example: TWO WEEKS ON, TWO WEEKS OFF.
–ROUTE	Route of Administration	Char	Variable Qualifier of –TRT	Route of administration for the intervention. Examples: ORAL, INTRAVENOUS.
Variable Name	Variable Label	Type	Role	Description
–LOT	Lot Number	Char	Record Qualifier	Lot number for the intervention described in –TRT.
–LOC	Location of Dose Administration	Char	Record Qualifier	Anatomical location of an intervention, such as an injection site. Example: ARM for an injection.
–LAT	Laterality	Char	Variable Qualifier of –LOC	Qualifier for anatomical location further detailing laterality of intervention administration. Examples: RIGHT, LEFT, BILATERAL
–DIR	Directionality	Char	Variable Qualifier of –LOC	Qualifier for anatomical location further detailing directionality of intervention administration. Examples: ANTERIOR, LOWER, PROXIMAL
–PORTOT	Portion or Totality	Char	Variable Qualifier of –LOC	Qualifier for anatomical location further detailing the distribution, which means arrangement of, apportioning of the intervention administration. Examples: ENTIRE, SINGLE, SEGMENT, MANY.
–FAST	Fasting Status	Char	Record Qualifier	Indicator used to identify fasting status. Valid values include Y, N, U or null if not relevant.
–PSTRG	Pharmaceutical Strength	Num	Variable Qualifier	Amount of an active ingredient expressed quantitatively per dosage unit, per unit of volume, or per unit of weight, according to the pharmaceutical dose form. Example: 50 mg/TABLET, 300 mg/L
–PSTRGU	Pharmaceutical Strength Units	Char	Variable Qualifier	Unit for –PSTRG. Example: mg/TABLET, mg/mL
–TRTV	Treatment Vehicle	Char	Record Qualifier	Vehicle for administration of treatment, such as a liquid in which the treatment drug is dissolved. Example: SALINE.
–VAMT	Treatment Vehicle Amount	Num	Record Qualifier	Amount of the prepared product (treatment + vehicle) administered or given. Note: should not be diluent amount alone.
–VAMTU	Treatment Vehicle Amount Units	Char	Variable Qualifier of –VAMT	Units for the prepared product (treatment + vehicle). Examples: mL, mg.
–ADJ	Reason for Dose Adjustment	Char	Record Qualifier	Describes reason or explanation of why a dose is adjusted. Examples ADVERSE EVENT, INSUFFICIENT RESPONSE, NON-MEDICAL REASON.

2.2.2 The Events Observation Class¶

Table 2.2.2: Events — Topic and Qualifier Variables, One Record per Event

Variable Name	Variable Label	Type	Role	Description
Topic Variable
–TERM	Reported Term	Char	Topic	Topic variable for an event observation, which is the verbatim or pre-specified name of the event.
Qualifier Variables
–MODIFY	Modified Reported Term	Char	Synonym Qualifier of –TERM	If the value for –TERM is modified for coding purposes, then the modified text is placed here.
Variable Name	Variable Label	Type	Role	Description
–LLT	Lowest Level Term	Char	Variable Qualifier of –TERM	MedDRA Lowest Level Term.
–LLTCD	Lowest Level Term Code	Num	Variable Qualifier of –LLT	MedDRA Lowest Level Term code.
–DECOD	Dictionary-Derived Term	Char	Synonym Qualifier of –TERM	Dictionary or sponsor-defined derived text description of the topic variable, –TERM, or the modified topic variable (–MODIFY), if applicable. Equivalent to the Preferred Term (PT in MedDRA).
–PTCD	Preferred Term Code	Num	Variable Qualifier of –DECOD	MedDRA Preferred Term code.
–HLT	High Level Term	Char	Variable Qualifier of –TERM	MedDRA High Level Term from the primary path.
–HLTCD	High Level Term Code	Num	Variable Qualifier of –HLT	MedDRA High Level Term code from the primary path.
–HLGT	High Level Group Term	Char	Variable Qualifier of –TERM	MedDRA High Level Group Term from the primary path.
–HLGTCD	High Level Group Term Code	Num	Variable Qualifier of –HLGT	MedDRA High Level Group Term code from the primary path.
–CAT	Category	Char	Grouping Qualifier	Used to define a category of topic-variable values.
–SCAT	Subcategory	Char	Grouping Qualifier	Used to define a further categorization of –CAT values.
–PRESP	Pre-specified	Char	Variable Qualifier of –TERM	Used to indicate whether the event described by –TERM was pre-specified on a CRF. Value is Y for pre-specified events, null for spontaneously reported events.
–OCCUR	Occurrence	Char	Record Qualifier	Used to record whether a pre-specified event occurred when information about the occurrence of a specific event is solicited.
–STAT	Completion Status	Char	Record Qualifier	Used to indicate when a question about the occurrence of a pre-specified event was not answered. Should be null or have a value of NOT DONE.
–REASND	Reason Not Done	Char	Record Qualifier	Reason not done. Used in conjunction with –STAT when its value is NOT DONE.
–BODSYS	Body System or Organ Class	Char	Record Qualifier	Body system or system organ class assigned for analysis from a standard hierarchy (e.g. MedDRA) associated with an event. Example: GASTROINTESTINAL DISORDERS.
–BDSYCD	Body System or Organ Class Code	Num	Variable Qualifier of –BODSYS	MedDRA System Organ Class code corresponding to –BODSYS assigned for analysis.
–SOC	Primary System Organ Class	Char	Variable Qualifier of –TERM	MedDRA primary System Organ Class associated with the event.
Variable Name	Variable Label	Type	Role	Description
–SOCCD	Primary System Organ Class Code	Num	Variable Qualifier of –SOC	MedDRA primary System Organ Class code.
–LOC	Location of Event	Char	Record Qualifier	Describes anatomical location relevant for the event. Example: ARM for skin rash.
–LAT	Laterality	Char	Variable Qualifier of –LOC	Qualifier for anatomical location further detailing laterality. Examples: RIGHT, LEFT, BILATERAL
–DIR	Directionality	Char	Variable Qualifier of –LOC	Qualifier for anatomical location further detailing directionality. Examples: ANTERIOR, LOWER, PROXIMAL
–PORTOT	Portion or Totality	Char	Variable Qualifier of –LOC	Qualifier for anatomical location further detailing the distribution, which means arrangement of, apportioning of. Examples: ENTIRE, SINGLE, SEGMENT, MANY.
–PARTY	Accountable Party	Char	Record Qualifier	Party accountable for the transferable object (e.g. device, specimen) as a result of the activity performed in the associated –TERM variable. The party could be an individual (e.g., subject), an organization (e.g., sponsor), or a location that is a proxy for an individual or organization (e.g., site). It is usually a somewhat general term that is further identified in the –PRTYID variable.
–PRTYID	Identification of Accountable Party	Char	Record Qualifier	Identification of the specific party accountable for the transferable object (e.g. device, specimen) after the action in –TERM is taken. Used in conjunction with –PARTY.
–SEV	Severity/Intensity	Char	Record Qualifier	The severity or intensity of the event. Examples: MILD, MODERATE, SEVERE.
–SER	Serious Event	Char	Record Qualifier	Is this is a serious event? Valid values are "Y" and "N".
–ACN	Action Taken with Study Treatment	Char	Record Qualifier	Describes changes made to the study treatment as a result of the event. Examples: DOSE INCREASED, DOSE NOT CHANGED.
–ACNOTH	Other Action Taken	Char	Record Qualifier	Describes other actions taken as a result of the event that are unrelated to dose adjustments of study treatment.
–ACNDEV	Action Taken with Device	Char	Record Qualifier	Action taken with respect to a device in a study, which may or may not be the device under study
–REL	Causality	Char	Record Qualifier	Records the investigator's opinion as to the causality of the event to the treatment. ICH E2A and E2B examples include NOT RELATED, UNLIKELY RELATED, POSSIBLY RELATED, RELATED.
–RELNST	Relationship to Non-Study Treatment	Char	Record Qualifier	An opinion as to whether the event may have been due to a treatment other than study drug. Example: "MORE LIKELY RELATED TO ASPIRIN USE."
–PATT	Pattern of Event	Char	Record Qualifier	Used to indicate the pattern of the event over time. Examples: INTERMITTENT, CONTINUOUS, SINGLE EVENT.
–OUT	Outcome of Event	Char	Record Qualifier	Description of the outcome of an event. Examples: RECOVERED/RESOLVED, FATAL.
–SCAN	Involves Cancer	Char	Record Qualifier	Was the event associated with the development of cancer? Valid values are "Y" and "N".
Variable Name	Variable Label	Type	Role	Description
–SCONG	Congenital Anomaly or Birth Defect	Char	Record Qualifier	Was the event associated with congenital anomaly or birth defect? Valid values are "Y" and "N".
–SDISAB	Persist or Signif Disability/Incapacity	Char	Record Qualifier	Did the event result in persistent or significant disability/incapacity? Valid values are "Y" and "N".
–SDTH	Results in Death	Char	Record Qualifier	Did the event result in death? Valid values are "Y" and "N".
–SHOSP	Requires or Prolongs Hospitalization	Char	Record Qualifier	Did the event require or prolong hospitalization? Valid values are "Y" and "N".
–SLIFE	Is Life Threatening	Char	Record Qualifier	Was the event life threatening? Valid values are "Y" and "N".
–SOD	Occurred with Overdose	Char	Record Qualifier	Did the event occur with an overdose? Valid values are "Y" and "N".
–SMIE	Other Medically Important Serious Event	Char	Record Qualifier	Do additional categories for seriousness apply? Valid values are "Y" and "N".
–CONTRT	Concomitant or Additional Trtmnt Given	Char	Record Qualifier	Was another treatment given because of the occurrence of the event? Valid values are "Y" and "N".
–TOX	Toxicity	Char	Variable Qualifier of –TOXGR	Description of toxicity quantified by –TOXGR such as NCI CTCAE Short Name. Examples: HYPERCALCEMIA, HYPOCALCEMIA. Sponsor should specify which scale and version is used in the Sponsor Comments column of the Define-XML document.
–TOXGR	Toxicity Grade	Char	Record Qualifier	Records toxicity grade using a standard toxicity scale (such as the NCI CTCAE). Sponsor should specify which scale and version is used in the Sponsor Comments column of the Define-XML document.

2.2.3 The Findings Observation Class¶

Table 2.2.3: Findings — Topic and Qualifier Variables, One Record per Finding

Variable Name	Variable Label	Type	Role	Description
Topic Variable
–TESTCD	Short Name of Measurement, Test or Examination	Char	Topic	Short character value for –TEST used as a column name when converting a dataset from a vertical format to a horizontal format. The short value can be up to 8 characters. Examples: PLAT, SYSBP, RRMIN, EYEEXAM.
Qualifier Variables
–TEST	Name of Measurement, Test or Examination	Char	Synonym Qualifier of –TESTCD	Long name For –TESTCD. Examples: Platelet, Systolic Blood Pressure, Summary (Min) RR Duration, Eye Examination.
–MODIFY	Modified Term	Char	Synonym Qualifier of –ORRES	If the value of –ORRES is modified for coding purposes, then the modified text is placed here.
Variable Name	Variable Label	Type	Role	Description
–TSTDTL	Measurement, Test or Examination Detail	Char	Variable Qualifier of –TESTCD and –TEST	Further description of –TESTCD and –TEST. Example: "The percentage of cells with +1 intensity of staining" when MITESTCD = TTF1.
–CAT	Category	Char	Grouping Qualifier	Used to define a category of topic-variable values. Examples: HEMATOLOGY, URINALYSIS, CHEMISTRY, HAMILTON DEPRESSION SCALE, SF36, MICRO ARRAY, EGFR MUTATION ANALYSIS
–SCAT	Subcategory	Char	Grouping Qualifier	Used to define a further categorization of –CAT values. Example: DIFFERENTIAL.
–POS	Position of Subject During Observation	Char	Record Qualifier	Position of the subject during a measurement or examination. Examples: SUPINE, STANDING, SITTING.
–BODSYS	Body System or Organ Class	Char	Record Qualifier	Body System or Organ Class that is involved for a finding from the standard hierarchy for dictionary-coded results. Example: MedDRA SOC.
–ORRES	Result or Finding in Original Units	Char	Result Qualifier	Result of the measurement or finding as originally received or collected. Examples: 120, <1, POS.
–ORRESU	Original Units	Char	Variable Qualifier of –ORRES	Unit for –ORRES. Examples: IN, LB, kg/L.
–ORNRLO	Normal Range Lower Limit-Original Units	Char	Variable Qualifier of –ORRES	Lower end of normal range or reference range for results stored in –ORRES.
–ORNRHI	Normal Range Upper Limit-Original Units	Char	Variable Qualifier of –ORRES	Upper end of normal range or reference range for results stored in –ORRES.
–STRESC	Result or Finding in Standard Format	Char	Result Qualifier	Contains the result value for all findings, copied or derived from –ORRES in a standard format or in standard units. –STRESC should store all results or findings in character format; if results are numeric, they should also be stored in numeric format in –STRESN. For example, if various tests have results "NONE", "NEG", and "NEGATIVE" in –ORRES and these results effectively have the same meaning, they could be represented in standard format in –STRESC as "NEGATIVE".
–STRESN	Numeric Result/Finding in Standard Units	Num	Result Qualifier	Used for continuous or numeric results or findings in standard format; copied in numeric format from –STRESC. –STRESN should store all numeric test results or findings.
–STRESU	Standard Units	Char	Variable Qualifier of –STRESC and –STRESN	Standardized units used for –STRESC and –STRESN. Example: mol/L.
–STNRLO	Normal Range Lower Limit-Standard Units	Num	Variable Qualifier of –STRESC and –STRESN	Lower end of normal range or reference range for standardized results (e.g., –STRESC, –STRESN) represented in standardized units (–STRESU).
–STNRHI	Normal Range Upper Limit-Standard Units	Num	Variable Qualifier of –STRESC and –STRESN	Upper end of normal range or reference range for standardized results (e.g., –STRESC, –STRESN) represented in standardized units (–STRESU).
Variable Name	Variable Label	Type	Role	Description
–STNRC	Normal Range for Character Results	Char	Variable Qualifier of –STRESC	Normal range or reference range for results stored in –STRESC that are character in ordinal or categorical scale. Example: Negative to Trace.
–NRIND	Normal/Reference Range Indicator	Char	Variable Qualifier of –ORRES	Used to indicate the value is outside the normal range or reference range. May be defined by –ORNRLO and –ORNRHI or other objective criteria. Examples: Y, N; HIGH, LOW; NORMAL; ABNORMAL.
–RESCAT	Result Category	Char	Variable Qualifier of –ORRES	Used to categorize the result of a finding. Example: MALIGNANT or BENIGN for tumor findings. RESISTANCE VARIANT for genetic variation.
–STAT	Completion Status	Char	Record Qualifier	Used to indicate that a question was not asked or a test was not done, or a test was attempted but did not generate a result. Should be null or have a value of NOT DONE.
–REASND	Reason Not Done	Char	Record Qualifier	Reason not done. Used in conjunction with –STAT when value is NOT DONE.
–XFN	External File Path	Char	Record Qualifier	Filename for an external file, such as one for an ECG waveform or a medical image.
–NAM	Laboratory/Vendor Name	Char	Record Qualifier	Name or identifier of the vendor (e.g., laboratory) that provided the test results.
–LOINC	LOINC Code	Char	Synonym Qualifier of –TESTCD	Logical Observation Identifiers Names and Codes (LOINC) code for the topic variable such as a lab test.
–SPEC	Specimen Material Type	Char	Record Qualifier	Defines the type of specimen used for a measurement. Examples: SERUM, PLASMA, URINE, DNA, RNA.
–ANTREG	Anatomical Region	Char	Variable Qualifier of –SPEC	Defines the specific anatomical or biological region of a tissue, organ specimen or the region from which the specimen is obtained, as defined in the protocol, such as a section or part of what is described in the –SPEC variable. Examples: CORTEX, MEDULLA, MUCOSA
–SPCCND	Specimen Condition	Char	Record Qualifier	Defines the condition of the specimen. Example: CLOUDY.
–SPCUFL	Specimen Usability for the Test	Char	Record Qualifier	Describes the usability of the specimen for the test. The value will be N if the specimen is not usable, and null if the specimen is usable.
–LOC	Location Used for the Measurement	Char	Record Qualifier	Anatomical location of the subject relevant to the collection of the measurement. Examples: RECTAL for temperature, ARM for blood pressure.
–LAT	Laterality	Char	Variable Qualifier of –LOC	Qualifier for anatomical location or specimen further detailing laterality. Examples: RIGHT, LEFT, BILATERAL
–DIR	Directionality	Char	Variable Qualifier of –LOC	Qualifier for anatomical location or specimen further detailing directionality. Examples: ANTERIOR, LOWER, PROXIMAL
–PORTOT	Portion or Totality	Char	Variable Qualifier of –LOC	Qualifier for anatomical location or specimen further detailing the distribution, which means arrangement of, apportioning of. . Examples: ENTIRE, SINGLE, SEGMENT, MANY.
–METHOD	Method of Test or Examination	Char	Record Qualifier	Method of the test or examination. Examples: EIA (Enzyme Immunoassay), ELECTROPHORESIS, DIPSTICK
Variable Name	Variable Label	Type	Role	Description
–RUNID	Run ID	Char	Record Qualifier	A unique identifier for a particular run of a test on a particular batch of samples.
–ANMETH	Analysis Method	Char	Record Qualifier	Analysis method applied to obtain a summarized result. Analysis method describes the method of secondary processing applied to a complex observation result (e.g. an image or a genetic sequence).
–LEAD	Lead Identified to Collect Measurements	Char	Record Qualifier	Lead or leads identified to capture the measurement for a test from an instrument. Examples: LEAD I, LEAD V2, LEAD CM5.
–CSTATE	Consciousness State	Char	Record Qualifier	The consciousness state of the subject at the time of measurement. Examples: CONSCIOUS, SEMI-CONSCIOUS, UNCONSCIOUS
–BLFL	Baseline Flag	Char	Record Qualifier	Indicator used to identify a baseline value. Should be Y or null.
–FAST	Fasting Status	Char	Record Qualifier	Indicator used to identify fasting status. Valid values include Y, N, U or null if not relevant.
–DRVFL	Derived Flag	Char	Record Qualifier	Used to indicate a derived record (e.g., a record that represents the average of other records such as a computed baseline). Should be Y or null.
–EVAL	Evaluator	Char	Record Qualifier	Role of the person who provided the evaluation. Used only for results that are subjective (e.g., assigned by a person or a group). Examples: ADJUDICATION COMMITTEE, INDEPENDENT ASSESSOR, RADIOLOGIST.
–EVALID	Evaluator Identifier	Char	Variable Qualifier of –EVAL	Used to distinguish multiple evaluators with the same role recorded in –EVAL. Examples: RADIOLOGIST1 or RADIOLOGIST2
–ACPTFL	Accepted Record Flag	Char	Record Qualifier	In cases where more than one assessor provides an evaluation of a result or response, this flag identifies the record that is considered, by an independent assessor, to be the accepted evaluation. Expected to be Y or null.
–TOX	Toxicity	Char	Variable Qualifier of –TOXGR	Description of toxicity quantified by –TOXGR such as NCI CTCAE Short Name. Examples: HYPERCALCEMIA, HYPOCALCEMIA. Sponsor should specify which scale and version is used in the Sponsor Comments column of the Define-XML document.
–TOXGR	Toxicity Grade	Char	Record Qualifier	Records toxicity grade using a standard toxicity scale (such as the NCI CTCAE). Sponsor should specify which scale and version is used in the Sponsor Comments column of the Define data definition document. Example: 2.
–SEV	Severity	Char	Record Qualifier	Describes the severity or intensity of a particular finding. Examples: MILD, MODERATE, SEVERE.
–DTHREL	Relationship to Death	Char	Record Qualifier	Describes the relationship of a particular finding to the death of a subject. Examples: Y, N or U.
–LLOQ	Lower Limit of Quantitation	Num	Variable Qualifier of –STRESC and –STRESN	Indicates the lower limit of quantitation for an assay. Units will be those used for –STRESU.
Variable Name	Variable Label	Type	Role	Description
–ULOQ	Upper Limit of Quantitation	Num	Variable Qualifier of –STRESC and –STRESN	Indicates the upper limit of quantitation for an assay. Units will be those used for –STRESU.
–EXCLFL	Exclude from Statistics	Char	Record Qualifier	Indicates whether the values in the result variables for this record should be excluded from summary statistical calculations such as Mean, Standard Deviation, and Count (and others). Expected to be Y or null. –EXCLFL should not be used when –STAT is NOT DONE. Not to be used with human clinical trials.
–REASEX	Reason for Exclusion from Statistics	Char	Record Qualifier	Reason excluded from statistics. Used in conjunction with –EXCLFL when its value is Y. Not to be used with human clinical trials

2.2.3.1 Findings About" Events or Interventions¶

Findings About Events or Interventions utilizes the Findings General Observation Class variables with the addition of the –OBJ variable as described in the following table. Note, the –OBJ variable must only be used in Findings About Events or Interventions.

Table 2.2.3.1: Findings About, Additional Qualifiers

Variable Name	Variable Label	Type	Role	Description
–OBJ	Object of the Observation	Char	Record Qualifier	Used in domains modeled as Findings About Events or Findings About Interventions. Describes the event or intervention whose property is being measured in –TESTCD/–TEST. Example: an event of vomiting which has findings, where –OBJ = "VOMIT" and the volume of VOMIT is being measured where –TESTCD = "VOLUME".

2.2.4 Identifiers for All Classes¶

All of the following Identifier variables are available for use in any domain based on one of the three general observation classes. STUDYID, DOMAIN, USUBJID (or POOLID), and –SEQ are required in all domains based on one of the three general observation classes.

All identifier variables are allowed for both SDTMIG and SENDIG.

Table 2.2.4: All Observation Classes — Identifiers

Variable Name	Variable Label	Type	Description
STUDYID	Study Identifier	Char	Unique identifier for a study.
DOMAIN	Domain Abbreviation	Char	Two-character abbreviation for the domain most relevant to the observation. The Domain abbreviation is also used as a prefix for variables to ensure uniqueness when datasets are merged.
USUBJID	Unique Subject Identifier	Char	Identifier used to uniquely identify a subject across all studies for all applications or submissions involving the product.
POOLID	Pool Identifier	Char	An identifier used to identify a result from a group of subjects that is not assignable to a specific subject.
SPDEVID	Sponsor Device Identifier	Char	Sponsor-defined identifier for a device
Variable Name	Variable Label	Type	Description
–SEQ	Sequence Number	Num	Sequence number to ensure uniqueness of records within a dataset for a subject (or within a parameter, in the case of the Trial Summary domain). May be any valid number (including decimals) and does not have to start at 1.
–GRPID	Group ID	Char	Optional group identifier, used to link together a block of related records within a subject in a domain. Also used to link together a block of related records in the Trial Summary dataset (Section 3.4).
–REFID	Reference ID	Char	Optional internal or external identifier such as lab specimen ID, or UUID for an ECG waveform or a medical image.
–SPID	Sponsor-Defined Identifier	Char	Sponsor-defined identifier. Example: pre-printed line identifier on a Concomitant Medications page.
–LNKID	Link ID	Char	Identifier used to link related records across domains. This may be a one-to-one or a one-to-many relationship. For Example: A single tumor may have multiple measurements/assessments performed at each study visit.
–LNKGRP	Link Group ID	Char	Identifier used to link related records across domains. This will usually be a many-to-one relationship. For example: Multiple tumor measurements/assessments will contribute to a single response to therapy determination record.

2.2.5 Timing Variables for All Classes¶

All of the following timing variables are available for use in any domain based on one of the three general observation classes except where restricted in implementation guide standard-domain-model assumptions.

All timing variables are allowed for both SDTMIG and SENDIG.

Table 2.2.5: All Observation Classes — Timing Variables

Variable Name	Variable Label	Type	Description
VISITNUM	Visit Number	Num	Clinical encounter number. Numeric version of VISIT, used for sorting.
VISIT	Visit Name	Char	Protocol-defined description of a clinical encounter.
VISITDY	Planned Study Day of Visit	Num	Planned study day of VISIT. Should be an integer.
TAETORD	Planned Order of Element within Arm	Num	Number that gives the planned order of the Element within the Arm (see Trial Arms, Section 3.2.2).
EPOCH	Epoch	Char	Epoch associated with the start date/time of the observation, or the date/time of collection if start date/time is not collected. (See Section 3.2.2).
–DTC	Date/Time of Collection	Char	Collection date and time of an observation represented in IS0 8601 character format.
–STDTC	Start Date/Time of Observation	Char	Start date/time of an observation represented in IS0 8601 character format.
–ENDTC	End Date/Time of Observation	Char	End date/time of the observation represented in IS0 8601 character format.
–DY	Study Day of Visit/Collection/Exam	Num	Actual study day of visit/collection/exam expressed in integer days relative to the sponsor-defined RFSTDTC in Demographics.
–STDY	Study Day of Start of Observation	Num	Actual study day of start of observation expressed in integer days relative to the sponsor-defined RFSTDTC in Demographics.
Variable Name	Variable Label	Type	Description
–ENDY	Study Day of End of Observation	Num	Actual study day of end of observation expressed in integer days relative to the sponsor-defined RFSTDTC in Demographics.
–DUR	Duration	Char	Collected duration of an event, intervention, or finding represented in ISO 8601 character format. Used only if collected on the CRF and not derived.
–TPT	Planned Time Point Name	Char	Text description of time when a measurement or observation should be taken as defined in the protocol. This may be represented as an elapsed time relative to a fixed reference point, such as time of last dose. See –TPTNUM and –TPTREF.
–TPTNUM	Planned Time Point Number	Num	Numeric version of planned time point used in sorting.
–ELTM	Planned Elapsed Time from Time Point Ref	Char	Planned Elapsed time in ISO 8601 character format relative to a planned fixed reference (–TPTREF) such as "Previous Dose" or "Previous Meal". This variable is useful where there are repetitive measures. Not a clock time or a date/time variable, but an interval, represented as ISO duration.
–TPTREF	Time Point Reference	Char	Description of the fixed reference point referred to by –ELTM, –TPTNUM, and –TPT. Examples: PREVIOUS DOSE, PREVIOUS MEAL.
–RFTDTC	Date/Time of Reference Time Point	Char	Date/time for a fixed reference time point defined by –TPTREF in ISO 8601 character format.
–STRF	Start Relative to Reference Period	Char	Identifies the start of the observation as being before, during, or after the sponsor-defined reference period. The sponsor-defined reference period is a continuous period of time defined by a discrete starting point and a discrete ending point represented by RFSTDTC and RFENDTC in Demographics. Note: This variable will be deprecated (phased out) in a future (post SDTM v1.4) release. The functionality of this variable can be replaced by the use of –STRTPT with –STTPT = RFSTDTC.
–ENRF	End Relative to Reference Period	Char	Identifies the end of the observation as being before, during or after the sponsor-defined reference period. The sponsor-defined reference period is a continuous period of time defined by a discrete starting point and a discrete ending point represented by RFSTDTC and RFENDTC in Demographics. Note: This variable will be deprecated (phased out) in a future (post SDTM v1.4) release. The functionality of this variable can be replaced by the use of –ENRTPT with –ENTPT = RFENDTC.
–EVLINT	Evaluation Interval	Char	Duration of interval associated with an observation such as a finding –TESTCD, represented in ISO 8601 character format. Example: -P2M to represent a period of the past 2 months as the evaluation interval for a question from a questionnaire such as SF-36.
–EVINTX	Evaluation Interval Text	Char	Evaluation interval associated with an observation, where the interval is not able to be represented in ISO 8601 format. Examples: LIFETIME, LAST NIGHT, RECENTLY, OVER THE LAST FEW WEEKS.
–STRTPT	Start Relative to Reference Time Point	Char	Identifies the start of the observation as being before or after the sponsor-defined reference time point defined by variable –STTPT.
–STTPT	Start Reference Time Point	Char	Description or date/time in ISO 8601 or other character format of the sponsor-defined reference point referred to by –STRTPT. Examples: "2003-12-15" or "VISIT 1".
–ENRTPT	End Relative to Reference Time Point	Char	Identifies the end of the observation as being before or after the sponsor-defined reference time point defined by variable –ENTPT.
Variable Name	Variable Label	Type	Description
–ENTPT	End Reference Time Point	Char	Description or date/time in ISO 8601 or other character format of the sponsor-defined reference point referred to by –ENRTPT. Examples: "2003-12-25" or "VISIT 2".
–STINT	Planned Start of Assessment Interval	Char	The start of a planned evaluation or assessment interval in ISO 8601 character format relative to the Time Point Reference (–TPTREF).
–ENINT	Planned End of Assessment Interval	Char	The end of a planned evaluation or assessment interval in ISO 8601 character format relative to the Time Point Reference (–TPTREF).
–DETECT	Time in Days to Detection	Num	The number of days from the start of dosing to the earliest detection of a condition or pathogen.

2.2.6 The Demographics Domain¶

Each study must include one standardized set of observations in a specific structure; this is the Demographics domain described in Table 2.2.6. Demographics is the parent domain for all other observations for subjects, and should be identified with the domain code of "DM". The Demographics domain describes the essential characteristics of the study subjects, and is used by reviewers for selecting subsets of subjects for analysis. The Demographics domain, as with other datasets, includes Identifiers, a Topic variable, Timing variables, and Qualifiers. Since DM has a fixed structure, only certain variables may be added as appropriate. See the implementation guides for guidance on which additional variables can be added to this domain.

Table 2.2.6: Subject Demographics Domain Variables

Variable Name	Variable Label	Type	Description
Identifier Variables
STUDYID	Study Identifier	Char	Unique identifier for a study.
DOMAIN	Domain Abbreviation	Char	Two-character abbreviation for the domain which must be DM.
USUBJID	Unique Subject Identifier	Char	Identifier used to uniquely identify a subject across all studies for all applications or submissions involving the product.
Topic Variables
SUBJID	Subject Identifier for the Study	Char	Subject identifier, which must be unique within the study. Often the ID of the subject as recorded on a CRF.
Qualifier Variables
RFSTDTC	Subject Reference Start Date/Time	Char	Reference Start Date/time for the subject in ISO 8601 character format. Usually equivalent to date/time when subject was first exposed to study treatment. Required for all randomized subjects; will be null for all subjects who did not meet the milestone the date requires, such as screen failures or unassigned subjects.
RFENDTC	Subject Reference End Date/Time	Char	Reference End Date/time for the subject in ISO 8601 character format. Usually equivalent to the date/time when subject was determined to have ended the trial, and often equivalent to date/time of last exposure to study treatment. Required for all randomized subjects; null for screen failures or unassigned subjects.
RFXSTDTC	Date/Time of First Study Treatment	Char	First date/time of exposure to any protocol-specified treatment or therapy for the subject in ISO 8601 character format.
RFXENDTC	Date/Time of Last Study Treatment	Char	Last date/time of exposure to any protocol-specified treatment or therapy for the subject in ISO 8601 character format.
Variable Name	Variable Label	Type	Description
RFICDTC	Date/Time of Informed Consent	Char	Date/time of informed consent in ISO 8601 character format.
RFPENDTC	Date/Time of End of Participation	Char	Date/time when subject ended participation or follow-up in a trial, in ISO 8601 character format. Should correspond to the last known date of contact.
DTHDTC	Date/Time of Death	Char	Date/time of death for any subject who died, in ISO 8601 format. Should represent the date/time that is captured in the clinical-trial database.
DTHFL	Subject Death Flag	Char	A value of 'Y' indicates the subject died. Should be Y or null. Should be populated even when the death date is unknown.
SITEID	Study Site Identifier	Char	Unique identifier for a site within a study.
INVID	Investigator Identifier	Char	An identifier to describe the Investigator for the study. May be used in addition to the SITEID. Not needed if SITEID is equivalent to INVID.
INVNAM	Investigator Name	Char	Unique identifier for a site within a study.
BRTHDTC	Date/Time of Birth	Char	Date/time of birth of the subject in ISO 8601 character format.
AGE	Age	Num	Age expressed in AGEU. May be derived as (RFSTDTC-BRTHDTC), but BRTHDTC may not be available in all cases (due to subject privacy concerns).
AGETXT	Age Text	Char	The age of the subject at study start, as planned, expressed as a range. If an age integer value is available, then populate the age variable instead. Either the AGE or AGETXT variable should be populated, but not both.
AGEU	Age Units	Char	Units associated with AGE or AGETXT.
SEX	Sex	Char	Sex of the subject.
RACE	Race	Char	Race of the subject. Sponsors should refer to "Collection of Race and Ethnicity Data in Clinical Trials" (FDA, September 2005) for guidance regarding the collection of race (http://www.fda.gov/downloads/RegulatoryInformation/Guidances/ucm126396.pdf)
ETHNIC	Ethnicity	Char	The ethnicity of the subject. Sponsors should refer to "Collection of Race and Ethnicity Data in Clinical Trials" (FDA, September 2005) for guidance regarding the collection of ethnicity (http://www.fda.gov/downloads/RegulatoryInformation/Guidances/ucm126396.pdf)
SPECIES	Species	Char	Used to identify the common species name of the subject (i.e., test system) under study (e.g., MOUSE, RAT, DOG, MONKEY).
STRAIN	Strain/Substrain	Char	Used to identify the vendor-supplied strain/substrain designation for the subject (i.e., test system) under study. When applicable, it combines the root strain, substrain, and associated genetic modifications, as supplied by the vendor (e.g., C57BL/6, A/J, B6.129-Pparg<tm2Rev>/J, FISCHER 344, SPRAGUE DAWLEY IGS, WISTAR Kyoto, BEAGLE, CYNOMOLGUS, CHIMPANZEE).
SBSTRAIN	Strain/Substrain Details	Char	Free-text field that allows the sponsor to enter additional details regarding the subject (i.e. test system) under study, such as a description of a specific genetic alteration.
ARMCD	Planned Arm Code	Char	Short name for the Arm to which the subject was assigned, limited to 20 characters.
ARM	Description of Planned Arm	Char	Name of the Arm to which the subject was assigned.
Variable Name	Variable Label	Type	Description
ACTARMCD	Actual Arm Code	Char	Short name for the actual Arm in which the subject participated during the trial, limited to 20 characters.
ACTARM	Description of Actual Arm	Char	Description of the actual Arm in which the subject participated during the trial.
SETCD	Set Code	Char	Short name of a specific Trial Set (see Table 3.2.4), as defined by the sponsor. Maximum of 8 characters. This represents the code for the Trial Set for which parameters are being submitted.
COUNTRY	Country	Char	Country of the investigational site at which the subject participated in the trial in ISO 3166 three-character format.
Timing Variables
DMDTC	Date/Time of Collection	Char	Date/time of collection of the demographic information in ISO 8601 character format.
DMDY	Study Day of Collection	Num	Study day of collection measured as integer days. Algorithm for calculations must be relative to the sponsor-defined RFSTDTC in Demographics.

2.2.7 The Comments Domain¶

Comments are collected during the conduct of many studies. These are normally supplied by a principal investigator, but might also be collected from other sources such as central reviewers. When collected, comments should be submitted in a single Comments domain, which is defined in Table 2.2.7.

Please see Implementation Guide for further guidance regarding use of additional Identifier and Timing variables.

Table 2.2.7: Comments Domain Variables

Variable Name	Variable Label	Type	Role	Description
STUDYID	Study Identifier	Char	Identifier	Unique identifier for a study.
DOMAIN	Domain Abbreviation	Char	Identifier	Two-character abbreviation for the domain which must be CO.
RDOMAIN	Related Domain Abbreviation	Char	Record Qualifier	Domain Abbreviation of the parent record(s). Null for records collected on general comments or additional information CRF page.
USUBJID	Unique Subject Identifier	Char	Identifier	Identifier used to uniquely identify a subject across all studies for all applications or submissions involving the product.
POOLID	Pool Identifier	Char	Identifier	Used to identify a result for pooled subjects that is not assignable to any one individual within the pool.
COSEQ	Sequence Number	Num	Identifier	Sequence number to ensure uniqueness within a domain. Should be assigned to be in a consistent chronological order.
IDVAR	Identifying Variable	Char	Record Qualifier	Identifying variable in the parent dataset that identifies the record(s) to which the comment applies. Examples AESEQ or CMGRPID. Used only when individual comments are related to domain records. Null for comments collected on separate CRFs.
IDVARVAL	Identifying Variable Value	Char	Record Qualifier	Value of identifying variable of the parent record(s). Null for comments collected on separate CRFs.
COREF	Comment Reference	Char	Record Qualifier	Sponsor-defined reference associated with the comment. May be the CRF page number (e.g. 650), or a module name (e.g. DEMOG), or a combination of information that identifies the reference (e.g. 650-VITALS-VISIT 2).
Variable Name	Variable Label	Type	Role	Description
COVAL	Comment	Char	Topic	The text of the comment. Text over 200 characters can be added to additional columns COVAL1-COVALn.
COEVAL	Evaluator	Char	Record Qualifier	Used to describe the originator of the comment. Example: CENTRAL REVIEWER.
CODTC	Date/Time of Comment	Char	Timing	Date/time of comment on dedicated comment form, if collected. Represented in ISO 8601 character format. Should be null if this is a child record of another domain or if comment date was not collected.

2.2.8 The Subject Elements Table¶

The Subject Elements table describes the actual order of Elements that were traversed by the subject, together with the start date/time and end date/time for each Element. These correspond to the planned Elements described in the Trial Elements (Section 3.2.1) of the Trial Design Model. Because actual data does not always follow the plan, the model allows for descriptions of an unplanned Element for subjects.

Please see Implementation Guide for further guidance regarding use of additional Identifier and Timing variables.

Table 2.2.8 Subject Elements: One Record per Actual Element per Subject

Variable Name	Variable Label	Type	Description
STUDYID	Study Identifier	Char	Unique identifier for a study.
DOMAIN	Domain Abbreviation	Char	Two-character abbreviation for the domain which must be SE.
USUBJID	Unique Subject Identifier	Char	Identifier used to uniquely identify a subject across all studies for all applications or submissions involving the product.
SESEQ	Sequence Number	Num	Sequence number to ensure uniqueness within dataset. Should be assigned to be in a consistent chronological order.
Topic Variable
ETCD	Element Code	Char	ETCD (the companion to ELEMENT) is limited to 8 characters and does not have special character restrictions. These values should be short for ease of use in programming, but it is not expected that ETCD will need to serve as a variable name.
Qualifier Variables
ELEMENT	Description of Element	Char	The name of the Element. If ETCD has a value of "UNPLAN" then ELEMENT should be null.
Timing Variables
SESTDTC	Start Date/Time of Element	Char	Start date/time for an Element for each subject, represented in ISO 8601 character format.
SEENDTC	End Date/Time of Element	Char	End date/time of an Element for each subject, represented in ISO 8601 character format.
TAETORD	Planned Order of Element within Arm	Num	Number that gives the planned order of the Element within the subject's assigned ARM.
EPOCH	Epoch	Char	Epoch associated with the Element in the planned sequence of Elements for the ARM to which the subject was assigned
Variable Name	Variable Label	Type	Description
Qualifier Variables
SEUPDES	Description of Unplanned Element	Char	Description of what happened to the subject during an unplanned Element. Used only if ETCD has the value of "UNPLAN".

2.2.9 The Subject Visits Table¶

The Subject Visits table describes the actual start and end date/time for each visit of each individual subject. These correspond to the planned visits described in the Trial Design Model Trial Visits table (see Section 3.2.3). Because actual data does not always follow the plan, the model allows for descriptions of unplanned visits for subjects.

Please see Implementation Guide for further guidance regarding use of additional Identifier and Timing variables.

Table 2.2.9 Subject Visits: One Record per Subject Visit, per Subject

this for tels

Variable Name	Variable Label	Type	Description
STUDYID	Study Identifier	Char	Unique identifier for a study.
DOMAIN	Domain Abbreviation	Char	Two-character abbreviation for the domain, which must be SV.
USUBJID	Unique Subject Identifier	Char	Identifier used to uniquely identify a subject across all studies for all applications or submissions involving the product.
Topic Variable
VISITNUM	Visit Number	Num	Clinical encounter number. (Decimal numbering may be useful for inserting unplanned visits.) Numeric version of VISIT, used for sorting.
Timing Variables
VISIT	Visit Name	Char	Protocol-defined description of clinical encounter or description of unplanned visit. May be used in addition to VISITNUM and/or VISITDY as a text description of the clinical encounter.
VISITDY	Planned Study Day of Visit	Num	Planned study day of the start of the visit based upon RFSTDTC in Demographics.
SVSTDTC	Start Date/Time of Visit	Char	Start date/time for a subject's visit, represented in ISO 8601 character format.
SVENDTC	End Date/Time of Visit	Char	End date/time of a subject's visit, represented in ISO 8601 character format.
SVSTDY	Study Day of Start of Visit	Num	Study day of start of visit relative to the sponsor-defined RFSTDTC.
SVENDY	Study Day of End of Visit	Num	Study day of end of visit relative to the sponsor-defined RFSTDTC.
Qualifier Variables
SVUPDES	Description of Unplanned Visit	Char	Description of what happened to the subject during an unplanned visit. Null for protocol-defined visits.

3 The Trial Design Model¶

3.1 Introduction¶

The Trial Design Model defines a standard structure for representing the planned sequence of events and the treatment plan for the trial. The model provides a standard way to define the treatment groups and planned visits and assessments that will be experienced by trial subjects.

The model is built upon the concepts of Elements, Arms, Epochs, and Visits. The variables corresponding to these concepts are used in many domains. The implementation guides define specific details and examples for Trial Design.

3.2 Planned Elements, Arms, and Visits¶

Under the model, planned information is presented in a series of four tables:

•The Trial Elements table (Table 3.2.1) describes the Element code (unique for each Element), the Element description, and the rules for starting and ending an Element. A rule could be expressed as pseudo code or as executable code for determining transitions from one Element to another.

•The Trial Arms table (Table 3.2.2) describes each planned Arm in the trial. An Arm is described as an ordered sequence of Elements, and the same Element may occur more than once in a given Arm. In order to accommodate complex Trial Designs, this table allows for rules for branching from one Element to another when a choice is available, and a rule for transitions to allow a subject to either skip ahead to another Element rather than proceed linearly.

•The Trial Visits table (Table 3.2.3) describes the planned order and number of visits in the study. In the case when visits vary for each Arm, there would be a separate record per Visit per Arm. It describes the allowable or planned values for VISIT, VISITNUM and VISITDY in the trial (which are subsequently used as Timing Variables for the collected study data), and rules for starting and ending each visit. In most blinded trials, the timing of visits is the same for all subjects in all Arms.

•The Trial Sets table (TX) allows the submission of detailed information about planned groups of subjects that result as a combination of experimental factors of interest for a study (including experimental parameters, inherent characteristics, and sponsor-defined attributes). A Set may be a planned subdivision of a Trial Arm, or may consist of one or more Trial Arms. These datasets are essential to determine whether data comparisons are feasible across different studies.

3.2.1 Trial Elements Table¶

Table 3.2.1 Trial Elements — One Record per Trial Element

Variable Name	Variable Label	Type	Description
STUDYID	Study Identifier	Char	Unique identifier for a study.
DOMAIN	Domain Abbreviation	Char	Two-character abbreviation for the domain, which must be TE.
ETCD	Element Code	Char	ETCD (the companion to ELEMENT) is limited to 8 characters and does not have special character restrictions. These values should be short for ease of use in programming, but it is not expected that ETCD will need to serve as a variable name.
ELEMENT	Description of Element	Char	The name of the Element.
TESTRL	Rule for Start of Element	Char	Expresses the rule for beginning the Element.
TEENRL	Rule for End of Element	Char	Expresses the rule for ending the Element. Either TEENRL or TEDUR must be present for each Element.
Variable Name	Variable Label	Type	Description
TEDUR	Planned Duration of Element	Char	Planned Duration of Element in ISO 8601 format. Used when the rule for ending the Element is applied after a fixed duration.

3.2.2 Trial Arms Table¶

Table 3.2.2 Trial Arms — One Record per Planned Element per Arm

Variable Name	Variable Label	Type	Description
STUDYID	Study Identifier	Char	Unique identifier for a study.
DOMAIN	Domain Abbreviation	Char	Two-character abbreviation for the domain, which must be TA.
ARMCD	Planned Arm Code	Char	ARMCD is limited to 20 characters and does not have special character restrictions. If the timing of visits for a trial does not depend on which ARM a subject is in, then ARMCD should be null.
ARM	Description of Planned Arm	Char	Name given to Arm or treatment group.
TAETORD	Planned Order of Element within Arm	Num	Number that gives the order of the Element within the Arm.
ETCD	Element Code	Char	ETCD (the companion to ELEMENT) is limited to 8 characters and does not have special character restrictions. These values should be short for ease of use in programming, but it is not expected that ETCD will need to serve as a variable name.
ELEMENT	Description of Element	Char	The name of the Element.
TABRANCH	Branch	Char	Condition subjects meet, at a "branch" in the Trial Design at the end of this Element, to be included in this Arm. Example: Randomization to DRUG X.
TATRANS	Transition Rule	Char	If the trial design allows a subject to transition to an Element other than the next Element in sequence, then the conditions for transitioning to those other Elements, and the alternative Element sequences, are specified in this rule (e.g., Responders go to washout).
EPOCH	Epoch	Char	Name of the Trial Epoch with which this Element of the Arm is associated.

Note: The same Element may occur more than once within an Arm, but each occurrence would have a different value for TAETORD and EPOCH, and may have different values for TABRANCH and TATRANS.

3.2.3 Trial Visits Table¶

Table 3.2.3 Trial Visits — One Record per Planned Trial Visit

Variable Name	Variable Label	Type	Description
STUDYID	Study Identifier	Char	Unique identifier for a study.
DOMAIN	Domain Abbreviation	Char	Two-character abbreviation for the domain, which must be TV.
VISITNUM	Visit Number	Num	Clinical encounter number. Numeric version of VISIT can be used for sorting.
VISIT	Visit Name	Char	Protocol-defined description of the clinical encounter. May be used in addition to VISITNUM and/or VISITDY as a text description of the clinical encounter.
Variable Name	Variable Label	Type	Description
VISITDY	Planned Study Day of Visit	Num	Planned study day of VISIT. Due to its sequential nature, can be used for sorting.
ARMCD	Planned Arm Code	Char	ARMCD is limited to 20 characters and does not have special character restrictions. If the timing of visits for a trial does not depend on which ARM a subject is in, then ARMCD should be null.
ARM	Description of Planned Arm	Char	Name given to Arm or treatment group.
TVSTRL	Visit Start Rule	Char	Rule describing when the visit starts, in relation to the sequence of Elements.
TVENRL	Visit End Rule	Char	Rule describing when the visit ends, in relation to the sequence of Elements.

3.2.4 Trial Sets Table¶

Table 3.2.4 Trial Sets — One Record per Trial Set Parameter

Variable Name	Variable Label	Type	Description
STUDYID	Study Identifier	Char	Unique identifier for a study.
DOMAIN	Domain Abbreviation	Char	Two-character abbreviation for the domain, which must be TV.
SETCD	Set Code	Char	Short name of a specific Trial Set, as defined by the sponsor. Maximum 8 characters. This represents the Trial Set for which parameters are being submitted.
SET	Set Description	Char	Long description of a specific Trial Set, as defined by the sponsor.
TXSEQ	Sequence Number	Num	Unique number for this record within this dataset.
TXPARMCD	Trial Set Parameter Short Name	Char	Short character value for the Trial Set parameter described in TXPARM. Maximum 8 characters.
TXPARM	Trial Set Parameter	Char	Term for the Trial Set parameter. Maximum 40 characters.
TXVAL	Trial Set Parameter Value	Char	Value of the Trial Set parameter (e.g., Fed ad libitum or Restricted Feeding when TXPARM is FEEDREG). Some parameters may be subject to controlled terminology.

3.3 Trial Inclusion/Exclusion Criteria¶

The Trial Inclusion Exclusion Domain (TI) contains one record for each of the inclusion and exclusion criteria for the trial.

3.3.1 Trial Inclusion/Exclusion Table¶

Table 3.3.1 Trial Inclusion/Exclusion - One Record per Trial Inclusion or Exclusion Criterion

Variable Name	Variable Label	Type	Description
STUDYID	Study Identifier	Char	Unique identifier for a study.
DOMAIN	Domain Abbreviation	Char	Two-character abbreviation for the domain, which must be TI.
Variable Name	Variable Label	Type	Description
IETESTCD	Inclusion/Exclusion Criterion Short Name	Char	Short name IETEST. It can be used as a column name when converting a dataset from a vertical to a horizontal format. The value in IETESTCD cannot be longer than 8 characters, nor can it start with a number (e.g., "1TEST"). IETESTCD cannot contain characters other than letters, numbers, or underscores. The name "IE" prefix is used to ensure consistency with the IE domain
IETEST	Inclusion/Exclusion Criterion	Char	Full text of the inclusion or exclusion criterion. The prefix "IE" is used to ensure consistency with the IE domain.
IECAT	Inclusion/Exclusion Category	Char	Used for categorization of the Inclusion/Exclusion Criterion: INCLUSION, EXCLUSION.
IESCAT	Inclusion/Exclusion Subcategory	Char	A further categorization of the exception criterion. Can be used to distinguish criteria for a sub-study or for to categorize as a major or minor exceptions. Examples: MAJOR, MINOR.
TIRL	Inclusion/Exclusion Criterion Rule	Char	Rule that expresses the criterion in computer-executable form.
TIVERS	Protocol Criteria Versions	Char	The number of this version of the Inclusion/Exclusion criteria. May be omitted if there is only one version.

3.4 Trial Summary Information¶

The Trial Summary Information Domain (TS) contains one record for each trial summary characteristic. Trial Summary is used to record basic information about the trial, such as trial phase, protocol title and design objectives.

3.4.1 Trial Summary Table¶

Table 3.4.1 Trial Summary — One Record per Trial Summary Parameter

Variable Name	Variable Label	Type	Description
STUDYID	Study Identifier	Char	Unique identifier for a study.
DOMAIN	Domain Abbreviation	Char	Two-character abbreviation for the domain, which must be TS.
TSSEQ	Sequence Number	Num	Sequence number to ensure uniqueness within the dataset.
TSGRPID	Group ID	Char	Used to tie together a group of related records.
TSPARMCD	Trial Summary Parameter Short Name	Char	TSPARMCD (the companion to TSPARM) is limited to 8 characters and does not have special character restrictions. These values should be short for ease of use in programming, but it is not expected that TSPARMCD will need to serve as variable names. Examples: AGEMIN, AGEMAX
TSPARM	Trial Summary Parameter	Char	Term for the Trial Summary Parameter. The value in TSPARM cannot be longer than 40 characters. ExamplesPlanned Minimum Age of Subjects, Planned Maximum Age of Subjects
TSVAL	Parameter Value	Char	Value of TSPARM. Example: "ASTHMA" when TSPARM value is "Trial Indications". If TSVAL is null; a value is required for TSVALNF. Text over 200 characters can be added to additional columns TSVAL1-TSVALn.
TSVALNF	Parameter Null Flavor	Char	Null flavor for the value of TSVAL describing the reason the value is null, to be populated if and only if TSVAL is null.
Variable Name	Variable Label	Type	Description
TSVALCD	Parameter Value Code	Char	Code of the term in TSVAL from Reference Terminology cited in TSVCDREF.
TSVCDREF	Name of the Reference Terminology	Char	The name of the Reference Terminology from which TSVALCD is taken. For example; CDISC, SNOMED, ISO 8601.
TSVCDVER	Version of the Reference Terminology	Char	The version number of the Reference Terminology cited in TSVCDREF, if applicable.

3.5 Trial Disease Assessments¶

The TD domain provides information on the planned protocol-specified disease assessment schedule. In oncology studies, good compliance with the disease-assessment schedule is essential to reduce the risk of 'assessment time bias'. The TD domain makes possible an evaluation of 'assessment time bias' from SDTM-based datasets, by allowing a comparison of the planned schedule of assessments against the actual occurrence of the efficacy assessments in order to determine the degree of compliance. TD has limited utility within oncology and was developed specifically with Response Evaluation Criteria in Solid Tumors (RECIST) in mind, and in particular, for studies with progression-free survival (PFS) endpoints where an assessment time bias analysis is appropriate.

3.5.1 Trial Disease Assessments Table¶

Table 3.5.1 Trial Disease Assessments - One Record per Planned Constant Assessment Period

Variable Name	Variable Label	Type	Description
STUDYID	Study Identifier	Char	Unique identifier for a study.
DOMAIN	Domain Abbreviation	Char	Two-character abbreviation for the domain, which must be TD.
TDORDER	Sequence of Planned Assessment Schedule	Num	A number given to ensure ordinal sequencing of the planned assessment schedules within a trial.
TDANCVAR	Anchor Variable Name	Char	A reference to the date variable name that provides the start point from which the planned disease assessment schedule is measured. This must be referenced from the ADaM ADSL dataset e.g. ANCH1DT. Note: TDANCVAR is to contain the name of a reference date variable name.
TDSTOFF	Offset from the Anchor	Char	A fixed offset from the date provided by the variable referenced in TDANCVAR. This is used when the timing of planned cycles does not start on the exact day referenced in the variable indicated in TDANCVAR. The value of this variable will be either zero or a positive value and will be represented in ISO 8601 character format.
TDTGTPAI	Planned Assessment Interval	Char	The planned interval between disease assessments represented in ISO 8601 character format.
TDMINPAI	Planned Assessment Interval Minimum	Char	The lower limit of the allowed range for the planned interval between disease assessments represented in ISO 8601 character format.
TDMAXPAI	Planned Assessment Interval Maximum	Char	The upper limit of the allowed range for the planned interval between disease assessments represented in ISO 8601 character format.
TDNUMRPT	Maximum Number of Actual Assessments	Num	This variable must represent the maximum number of actual assessments for the analysis that this disease assessment schedule describes. In a trial where the maximum number of assessments is not defined explicitly in the protocol (e.g. assessments occur until death) TDNUMRPT should represent the maximum number of disease assessments that support the efficacy analysis, encountered by any subject across the trial at that point in time.

4 Representing Relationships among Datasets and Records¶

There are many occasions when it is necessary or desirable to represent relationships among datasets or records. The SDTM identifies eight distinct types of relationships:

•A relationship between a group of records for a given subject within the same dataset.

•A relationship between independent records (usually in separate datasets) for a subject, such as a concomitant medication taken to treat an adverse event.

•A relationship between two (or more) datasets where records of one (or more) dataset(s) are related to record(s) in another dataset (or datasets).

•A dependent relationship where data that cannot be represented by a standard variable within a general-observation-class dataset record (or records) can be related back to that record.

•A dependent relationship between a comment in the Comments domain and a parent record (or records) in other datasets, such as a comment recorded in association with an adverse event.

•A relationship between a subject and a pool of subjects.

•A relationship between a subject and associated person(s). (See section 2.4)

•A relationship between subjects in a study who are not part of a pool.

The implementation guides define specific details and examples for each of these relationships.

4.1 Datasets for Representing Relationships¶

Table 4.1.1 RELREC Dataset

Variable Name	Variable Label	Type	Description
STUDYID	Study Identifier	Char	Study Identifier of the domain record(s).
RDOMAIN	Related Domain Abbreviation	Char	Two-character abbreviation for the domain of the parent record(s).
USUBJID	Unique Subject Identifier	Char	Identifier used to uniquely identify a subject across all studies for all applications or submissions involving the product.
APID	Associated Persons Identifier	Char	Identifier for a single associated person, a group of associated persons, or a pool of associated persons. If APID identifies a pool, POOLDEF records must exist for each associated person. (See Section 5 for Associated Persons data)
POOLID	Pool Identifier	Char	Identifier used for pooling subjects to assign a single finding to multiple subjects.
IDVAR	Identifying Variable	Char	Name of the identifying variable in the general-observation-class dataset that identifies the related record(s). Examples include –SEQ and –GRPID.
IDVARVAL	Identifying Variable Value	Char	Value of identifying variable described in IDVAR. If –SEQ is the variable being used to describe this record, then the value of –SEQ would be entered here.
RELTYPE	Relationship Type	Char	Identifies the hierarchical level of the records in the relationship. Values should be either ONE or MANY. However, values are only necessary when identifying a relationship between datasets.
Variable Name	Variable Label	Type	Description
RELID	Relationship Identifier	Char	Unique value within USUBJID that identifies the relationship. All records for the same USUBJID that have the same RELID are considered "related/associated." RELID can be any value the sponsor chooses, and is only meaningful within the RELREC dataset to identify the related/associated Domain records.

4.1.2 Supplemental Qualifiers Dataset¶

Table 4.1.2 SUPPQUAL Dataset

Variable Name	Variable Label	Type	Description
STUDYID	Study Identifier	Char	Study Identifier of the parent record(s).
RDOMAIN	Related Domain Abbreviation	Char	Two-character abbreviation for the domain of the parent record(s).
USUBJID	Unique Subject Identifier	Char	Identifier used to uniquely identify a subject across all studies for all applications or submissions involving the product.
APID	Associated Persons Identifier	Char	Identifier for a single associated person, a group of associated persons, or a pool of associated persons. If APID identifies a pool, POOLDEF records must exist for each associated person. (See Section 5 for Associated Persons data)
POOLID	Pool Identifier	Char	Identifier used for pooling subjects to assign a single finding to multiple subjects.
IDVAR	Identifying Variable	Char	Identifying variable in the parent dataset that identifies the related record(s). Examples: –SEQ, –GRPID.
IDVARVAL	Identifying Variable Value	Char	Value of identifying variable of the parent record(s).
QNAM	Qualifier Variable Name	Char	The short name of the Qualifier variable, which is used as a column name in a domain view with data from the parent domain. The value in QNAM cannot be longer than 8 characters, nor can it start with a number (e.g., "1TEST"). QNAM cannot contain characters other than letters, numbers, or underscores. This will often be the column name in the sponsor's operational dataset.
QLABEL	Qualifier Variable Label	Char	This is the long name or label associated with QNAM. The value in QLABEL cannot be longer than 40 characters. This will often be the column label in the sponsor's original dataset.
QVAL	Data Value	Char	Result of, response to, or value associated with QNAM. A value for this column is required; no records can be in a SUPP– dataset with a null value for QVAL
QORIG	Origin	Char	Since QVAL can represent a mixture of collected (on a CRF), derived, or assigned items, QORIG is used to indicate the origin of this data. Examples include CRF, ASSIGNED, or DERIVED.
QEVAL	Evaluator	Char	Used only for results that are subjective (e.g., assigned by a person or a group). Should be null for records that contain objectively collected or derived data. Some examples include ADJUDICATION COMMITTEE, STATISTICIAN, DATABASE ADMINISTRATOR, CLINICAL COORDINATOR, etc.

4.1.3 Pool Definition Dataset¶

This dataset identifies individual subjects included in a pool of subjects for which a single observation record (pool level) is captured.

Table 4.1.3 POOLDEF Dataset

Variable Name	Variable Label	Type	Description
STUDYID	Study Identifier	Char	Study Identifier of the parent record(s).
POOLID	Pool Identifier	Char	Identifier used for pooling subjects to assign a single finding to multiple subjects.
USUBJID	Unique Subject Identifier	Char	Identifier used to uniquely identify a subject across all studies for all applications or submissions involving the product.
APID	Associated Persons Identifier	Char	Identifier for a single associated person, a group of associated persons, or a pool of associated persons.

Some studies include subjects who are related to each other, and in some cases it is important to record those relationships. Studies in which pregnant women are treated and both the mother and her child(ren) are study subjects are the most common case in which relationships between subjects are collected. There are also studies of genetically based diseases where subjects who are related to each other are enrolled, and the relationships between subjects are recorded.

Table 4.1.4 RELSUB Dataset

Variable Name	Variable Label	Type	CDISC Notes
STUDYID	Study Identifier	Char	Unique identifier for a study.
USUBJID	Unique Subject Identifier	Char	Identifier used to uniquely identify a subject across all studies for all applications or submissions involving the product. Either USUBJID or POOLID must be populated.
POOLID	Pool Identifier	Char	Identifier used to identify a pool of subjects. If POOLID is entered, POOLDEF records must exist for each subject in the pool and USUBJID must be null. Either USUBJID or POOLID must be populated.
RSUBJID	Related Subject or Pool Identifier	Char	Identifier used to identify a related subject or pool of subjects. RSUBJID will be populated with either the USUBJID of the related subject or the POOLID of the related pool.
SREL	Subject Relationship	Char	Describes the relationship of the subject identified in USUBJID or the pool identified in POOLID to the subject or pool identified in RSUBJID.

5 Applying Model Fundamentals to Associated Persons¶

5.1 Creating Associated Persons Domains¶

Associated Persons (AP) are persons who can be associated with a study, a particular study subject or a device used in the study. An AP may or may not have a familial relationship to a study subject. AP domains are created using SDTM variables, with the application of specific AP rules, including:

•Implementers creating AP domains will follow the AP assumptions for the Identifier variables.

•AP will be the prefix for the domain and dataset name, and will identify the dataset as AP data.

•APID will be required in all AP datasets, and will identify records in a data warehouse as AP data.

The Study Data Tabulation Model Associate Persons Implementation Guide (SDTMIG-AP) provides implementation rules and advice. Unless an exception is described in this implementation guide, all other general assumptions about SDTM and SDTMIG variables and domains will apply to AP data.

5.1.1 Variables Used in Associated Persons Data¶

Table 5.1.1 Associated Persons Data – Identifier Variables

Variable Name	Variable Label	Type	CDISC Notes
APID	Associated Persons Identifier	Char	Identifier for a single associated person, a group of associated persons, or a pool of associated persons. If APID identifies a pool, POOLDEF records must exist for each associated person.
RSUBJID	Related Subject	Char	Identifier for a related study subject or pool of study subjects. The subject(s) may be human or animal. RSUBJID will be populated with the USUBJID of the related subject or the POOLID of the related pool. RSUBJID will be null for data about associated persons who are related to the study but not to any of the study subjects.
RDEVID	Related Device	Char	Identifier for a related device. RDEVID will be populated with the SPDEVID of the related device
SREL	Subject, Device, or Study Relationship	Char	If RSUBJID is populated, describes the relationship of the associated person(s) identified in APID to the subject or pool identified in RSUBJID. If RDEVID is populated, describes the relationship of the associated person(s) identified in APID to the device identified in RDEVID. If RSUBJID and RDEVID are null, SREL describes the relationship of the associated person(s) identified in APID to the study identified in STUDYID.

5.2 Associated Person Relationships¶

5.2.1 Relationships between Associated Persons and Subjects¶

AP domains include relationship information because it is understood that a relationship must exist for a non-subject to be considered an AP. However, in cases where an AP has relationships to multiple subjects or devices and/or multiple relationships to a single subject or device, a single value in SREL is inadequate to describe these multiple relationships. In those cases, the value MULTIPLE should appear in SREL. If an AP has relationships with multiple subjects, MULTIPLE may also appear in RSUBJID. When other SDTM variables are populated with MULTIPLE, the multiple values are stored in Supplemental Qualifiers. However, this was found to be an indirect and cumbersome way to handle multiple relationships of an AP to subject(s). In addition, if an AP had data in multiple domains, the Supplemental Qualifier approach would require the same set of Supplemental Qualifiers to be repeated for each domain. The APRELSUB table, which parallels the structure of the RELSUB table, was created as a more efficient and simpler way to record these multiple relationships. The APRELSUB table is required for studies in

which SREL values of MULTIPLE appear, but would not be needed if each AP has only one relationship to one subject.

Table 5.2.1 APRELSUB Dataset

Variable Name	Variable Label	Type	Role	CDISC Notes
STUDYID	Study Identifier	Char	Identifier	Unique identifier for a study.
APID	Associated Persons Identifier	Char	Identifier	Identifier for a single associated person, a group of associated persons, or a pool of associated persons. If APID identifies a pool, POOLDEF records must exist for each associated person.
RSUBJID	Related Subject or Pool Identifier	Char	Identifier	Identifier for a related subject or pool of subjects. RSUBJID may be populated with the USUBJID of the related subject or the POOLID of the related pool. RSUBJID will be null for data about associated persons who are related to the study but not to any study subjects.
RDEVID	Related Device Identifier	Char	Identifier	Identifier for a related device. RDEVID will be populated with the SPDEVID of the related device.
SREL	Subject, Device, or Study Relationship	Char	Record Qualifier	If RSUBJID is populated, describes the relationship of the associated person(s) identified in APID to the subject or pool identified in RSUBJID. If RDEVID is populated, describes the relationship of the associated person(s) identified in APID to the subject or pool identified in RDEVID. If RSUBJID and RDEVID are null, SREL describes the relationship of the associated person(s) identified in APID to the study identified in STUDYID.

6 Using the Model for Regulatory Submissions¶

The SDTM has been designed to accommodate the broadest range of human and animal study data in a standardized manner. This document describes the basic concepts and general structures of the model. Individual implementation guides have been created to provide specific recommendations for numerous domains of data commonly collected in human, animal and medical device studies, identifying which variables from a general observation class may apply. These implementation guides also describe basic assumptions and business rules, and provide numerous examples for mapping data to the standard format. Any sponsor wishing to submit data in the standard formats should first consult the implementation guides before preparing a regulatory submission based on the SDTM. The following implementation guides have been published by CDISC:

1.The Study Data Tabulation Model Implementation Guide for Human Clinical Trials (SDTMIG)

2.The Standard for Exchange of Non-Clinical Data Implementation Guide (SENDIG)

3.The Study Data Tabulation Model Implementation Guide for Medical Devices (SDTMIG-MD)

4.The Study Data Tabulation Model Associated Persons Implementation Guide (SDTMIG-AP)

7 SDTM Version History¶

Version 1.4 represents the fifth formal release of the Study Data Tabulation Model. The original version was released as the Study Data Tabulation Model Version 1.0 in June 2004. The last prior version was issued in July 2012.

7.1 Changes from SDTM V1.3 to SDTM V1.4¶

7.1.1 Additions¶

Table 2.2.1 - Interventions:

•–MOOD Mood

•–LAT Laterality

•–DIR Directionality

•–PORTOT Portion or Totality

•–FAST Fasting Status

•–PSTRG Pharmaceutical Strength

•–PSTRG Pharmaceutical Strength Unit

Table 2.2.2 - Events:

•–LAT Laterality

•–DIR Directionality

•–PORTOT Portion or Totality

•–PARTY Accountable Party

•–PRTYID Identification of Accountable Party

•–ACNDEV Action Taken with Device

Table 2.2.3 - Findings;

•–TSTDTL Measurement, Test or Examination Detail

•–RUNID Run ID

•–ANMETH Method of Analysis

•–ULOQ Upper Limit of Quantitation

Table 2.2.4 - Identifiers;

•SPDEVID Sponsor Device Identifier

Table 2.2.5 - Timing Variables;

•–EVINTX Evaluation Interval Text

•–STINT Planned Start of Assessment Interval

•–ENINT Planned End of Assessment Interval

Table 3.5.1 - Trial Disease Assessments: (New Table):

•STUDYID Study Identifier

•DOMAIN Domain Abbreviation

•TDORDER Sequence of Planned Assessment Schedule

•TDANCVAR Anchor Variable Name

•TDSTOFF Offset from the Anchor

•TDTGTPAI Planned Assessment Interval

•TDMINPAI Planned Assessment Interval Minimum

•TDMAXPAI Planned Assessment Interval Maximum

•TDNUMRPT Maximum Number of Actual Assessments

Table 4.1.1 – RELREC Dataset:

•APID Associated Persons Identifier

•POOLID Pool Identifier

Table 4.1.2 – SUPPQUAL Dataset:

•APID Associated Persons Identifier

•POOLID Pool Identifier

Table 4.1.3 – POOLDEF Dataset:

•APID Associated Persons Identifier

Table 4.1.4 – RELSUB Dataset: (New Table)

•STUDYID Study Identifier

•USUBJID Unique Subject Identifier

•POOLID Pool Identifier

•RSUBJID Related Subject

•SREL Subject, Device, or Study Relationship

Table 5.1.1 – Associated Persons – Identifier variables: (New Table)

•APID Associated Persons Identifier

•RSUBJID Related Subject

•RDEVID Related Device

•SREL Subject, Device, or Study Relationship

Table 5.2.1 – APRELSUB Dataset: (New Table)

•STUDYID Study Identifier

•APID Associated Persons Identifier

•RSUBJID Related Subject

•RDEVID Related Device

•SREL Subject, Device, or Study Relationship

7.2 Variables to be deprecated¶

7.2.1 Proposed deletions post SDTM V1.4¶

Table 2.2.5 – Timing variable for all classes:

•–STRF: The functionality of this variable can be replaced by the use of –STRTPT with –STTPT = RFSTDTC.

•–ENRF: The functionality of this variable can be replaced by the use of –ENRTPT with –ENTPT = RFENDTC.

Appendices¶

Appendix A: Representations and Warranties, Limitations of Liability, and Disclaimers¶

CDISC Patent Disclaimers¶

It is possible that implementation of and compliance with this standard may require use of subject matter covered by patent rights. By publication of this standard, no position is taken with respect to the existence or validity of any claim or of any patent rights in connection therewith. CDISC, including the CDISC Board of Directors, shall not be responsible for identifying patent claims for which a license may be required in order to implement this standard or for conducting inquiries into the legal validity or scope of those patents or patent claims that are brought to its attention.

Representations and Warranties¶

Each Participant in the development of this standard shall be deemed to represent, warrant, and covenant, at the time of a Contribution by such Participant (or by its Representative), that to the best of its knowledge and ability: (a) it holds or has the right to grant all relevant licenses to any of its Contributions in all jurisdictions or territories in which it holds relevant intellectual property rights; (b) there are no limits to the Participant¹s ability to make the grants, acknowledgments, and agreements herein; and © the Contribution does not subject any Contribution, Draft Standard, Final Standard, or implementations thereof, in whole or in part, to licensing obligations with additional restrictions or requirements inconsistent with those set forth in this Policy, or that would require any such Contribution, Final Standard, or implementation, in whole or in part, to be either: (i) disclosed or distributed in source code form; (ii) licensed for the purpose of making derivative works (other than as set forth in Section 4.2 of the CDISC Intellectual Property Policy (³the Policy²)); or (iii) distributed at no charge, except as set forth in Sections 3, 5.1, and 4.2 of the Policy. If a Participant has knowledge that a Contribution made by any Participant or any other party may subject any Contribution, Draft Standard, Final Standard, or implementation, in whole or in part, to one or more of the licensing obligations listed in Section 9.3, such Participant shall give prompt notice of the same to the CDISC President who shall promptly notify all Participants.

No Other Warranties/Disclaimers. ALL PARTICIPANTS ACKNOWLEDGE THAT, EXCEPT AS PROVIDED UNDER SECTION 9.3 OF THE CDISC INTELLECTUAL PROPERTY POLICY, ALL DRAFT STANDARDS AND FINAL STANDARDS, AND ALL CONTRIBUTIONS TO FINAL STANDARDS AND DRAFT STANDARDS, ARE PROVIDED ³AS IS² WITH NO WARRANTIES WHATSOEVER, WHETHER EXPRESS, IMPLIED, STATUTORY, OR OTHERWISE, AND THE PARTICIPANTS, REPRESENTATIVES, THE CDISC PRESIDENT, THE CDISC BOARD OF DIRECTORS, AND CDISC EXPRESSLY DISCLAIM ANY WARRANTY OF MERCHANTABILITY, NONINFRINGEMENT, FITNESS FOR ANY PARTICULAR OR INTENDED PURPOSE, OR ANY OTHER WARRANTY OTHERWISE ARISING OUT OF ANY PROPOSAL, FINAL STANDARDS OR DRAFT STANDARDS, OR CONTRIBUTION.

Limitation of Liability¶

IN NO EVENT WILL CDISC OR ANY OF ITS CONSTITUENT PARTS (INCLUDING, BUT NOT LIMITED TO, THE CDISC BOARD OF DIRECTORS, THE CDISC PRESIDENT, CDISC STAFF, AND CDISC MEMBERS) BE LIABLE TO ANY OTHER PERSON OR ENTITY FOR ANY LOSS OF PROFITS, LOSS OF USE, DIRECT, INDIRECT, INCIDENTAL, CONSEQUENTIAL, OR SPECIAL DAMAGES, WHETHER UNDER CONTRACT, TORT, WARRANTY, OR OTHERWISE, ARISING IN ANY WAY OUT OF THIS POLICY OR ANY RELATED AGREEMENT, WHETHER OR NOT SUCH PARTY HAD ADVANCE NOTICE OF THE POSSIBILITY OF SUCH DAMAGES.

Note: The CDISC Intellectual Property Policy can be found at:

http://www.cdisc.org/bylaws-and-policies

SDTM Version 1.4¶

1. Introduction¶

1.1 Purpose¶

1.2 Relationship to Prior CDISC Models¶

1.3 Significant Changes from Prior Versions¶

2 Model Fundamentals¶

2.1 Model Concepts and Terms¶

2.2 The General Observation Classes¶

2.2.1 The Interventions Observations Class¶

2.2.2 The Events Observation Class¶

2.2.3 The Findings Observation Class¶

2.2.3.1 Findings About" Events or Interventions¶

2.2.4 Identifiers for All Classes¶

2.2.5 Timing Variables for All Classes¶

2.2.6 The Demographics Domain¶

2.2.7 The Comments Domain¶

2.2.8 The Subject Elements Table¶

2.2.9 The Subject Visits Table¶

3 The Trial Design Model¶

3.1 Introduction¶

3.2 Planned Elements, Arms, and Visits¶

3.2.1 Trial Elements Table¶

3.2.2 Trial Arms Table¶

3.2.3 Trial Visits Table¶

3.2.4 Trial Sets Table¶

3.3 Trial Inclusion/Exclusion Criteria¶

3.3.1 Trial Inclusion/Exclusion Table¶

3.4 Trial Summary Information¶

3.4.1 Trial Summary Table¶

3.5 Trial Disease Assessments¶

3.5.1 Trial Disease Assessments Table¶

4 Representing Relationships among Datasets and Records¶

4.1 Datasets for Representing Relationships¶

4.1.1 Related Records Dataset¶

4.1.2 Supplemental Qualifiers Dataset¶

4.1.3 Pool Definition Dataset¶

4.1.4 Related Subjects Dataset¶

5 Applying Model Fundamentals to Associated Persons¶

5.1 Creating Associated Persons Domains¶

5.1.1 Variables Used in Associated Persons Data¶

5.2 Associated Person Relationships¶

5.2.1 Relationships between Associated Persons and Subjects¶

6 Using the Model for Regulatory Submissions¶

7 SDTM Version History¶

7.1 Changes from SDTM V1.3 to SDTM V1.4¶

7.1.1 Additions¶

7.2 Variables to be deprecated¶

7.2.1 Proposed deletions post SDTM V1.4¶

Appendices¶

Appendix A: Representations and Warranties, Limitations of Liability, and Disclaimers¶

CDISC Patent Disclaimers¶

Representations and Warranties¶

Limitation of Liability¶