Coding Guide
↗ Trial Warehouse Introduction | ↗ Codebook
The data warehouse has its own set of rules determining how questionnaires and items should be named in the dataset. This ensures that variables measuring the same question or construct have an identical name across trials. This is important to allow for easy and error-free data merges. For variables with factor levels (e.g., sex), it is also essential to follow the coding guidelines to ensure that codes are consistent.
Consulting the coding guide is important when you:
inc
) is coded as a factor with specific levels in the data warehouse, and you may either assess income as the exact number, or use the same factor levels as are used in the data warehouse.To access the documentation, you can either use the protectr
R package, or the Airtable Graphical User Interface.
.
assessment_point.i
item_number.0
), to post-test (1
), follow-up (2
), follow-up 2 (3
), and so forth.0_s
.0_1
, 0_2
, 0_3
.sex
, not as sex.0.i1
. The number of completed sessions is only “assessed” at post-test/follow-up, and therefore also has no assessment time and item code (sess
).zuf
(CSQ-8). This questionnaire is conventionally used at post-test. Therefore, the assessment point code is dropped, e.g. item 1 is coded as zuf.i1
, the sum score as zuf
.bfi
(BFI-10). This questionnaire assessed personality, and is therefore conventionally used at baseline only. The questionnaire has 5 subscales corresponding with the “Big Five” of personality assessment. In some trials, sum scores for these subscales are provided, for which the assessment point is dropped (e.g., bfi.a
for agreeableness). However, for the item-level data, the conventional coding style is used (e.g. bfi.0.i1
).cesd.1
for the CES-D sum score at post-test). Same is also true for questionnaires consisting of only one item which are assessed at several timepoints.eri.r.0.i1
).0
and “yes” as 1
.IMPORTANT
Although we aim to conform as strictly as possible to the coding guidelines, there can still be cases where exceptions have to be made. The best approach is to directly download a dataset containing the respective questionnaire to see how the coding was done. If you have questions concerning the coding, you can also contact Mathias (mathias.harrer@fau.de).
If you want to have your data stored in the warehouse, please send it to Mathias (mathias.harrer@fau.de), preferrably via secure options such as FAUBox. We will then upload the tidy dataset to the warehouse, where it can be accessed by functions of the protectr
package. It is also possible to store the “raw”, uncleaned data in the data warehouse only for you.