Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Issue title: Machine Learning in Applied Statistics
Guest editors: Jong-Min Kim
Article type: Research Article
Authors: Choi, Yoonha | Babiarz, Joshua | Tom, Ed | Kennedy, Giulia C. | Huang, Jing*
Affiliations: Veracyte Inc., South San Francisco, CA, USA
Correspondence: [*] Corresponding author: Jing Huang, Veracyte Inc., 6000 Shoreline Ct #300, South San Francisco, CA 94080, USA. E-mail: [email protected].
Abstract: BACKGROUND AND OBJECTIVES: Kinship coefficients measure relatedness between two individuals and have wide usage in genetic applications. In this study, we repurpose the kinship coefficient to directly facilitate sample tracking to identify potential sample swaps. Such sample integrity metrics are particularly important for the following two scenarios in large-scale clinical studies: First, multiple biological samples from the same individual were routinely processed as unique samples or technical replicates. Querying the relatedness of genomic data of two samples can identify sample swaps prior to inappropriate inclusion in data analysis. In the second scenario, different biological analytes from the same samples were run across multiple platforms and it is critical to establish the correct mapping for each individual sample, linking genomic information derived from multiple platforms to the same sample. For both cases, all downstream inferences rely on such correct mapping. Kinship coefficients can directly measure the mapping accuracy and ensure the required sample integrity. MATERIALS AND METHODS: We first describe the general concept of kinship coefficients and focus on the novel adaptations on feature (i.e. variants and/or SNPs) selection utilizing expressed variants to make it suitable for the clinical setting. RESULTS: We illustrate the adapted kinship coefficients estimate in two studies: one for lung fibrosis where multiple samples were routinely collected from each patient and one for thyroid cancers where a cohort of samples was run on different platforms. CONCLUSION: We demonstrate the effectiveness of using kinship coefficients to improve sample integrity and discuss potential improvements in the methodology.
Keywords: Kinship, sample integrity, clinical, next generation sequencing
DOI: 10.3233/MAS-170401
Journal: Model Assisted Statistics and Applications, vol. 12, no. 3, pp. 265-273, 2017
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]