子集Plink二进制文件(床,BIM& fam)
大家好,我正在研究基因型数据,我有GWAS的摘要统计信息,我有床,BIM和FAM文件。因为个人的数量很多,所以我想从我的二进制文件中以3000个随机的数量进行采样。换句话说,我想将二进制文件子集。您知道如何通过Plink,R或Python做到这一点吗?
Hi everyone I am working on genotype data, I have bed, bim and fam files with the summary statistics of GWAS. Because the number of individuals are a lot, so I want to sample from my binary files in numbers of 3000 randomly . In other words, I want to subset the binary files. Do you know how can I do that by plink, R or python?
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。

绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
您可以使用Plink实现这一目标。首先,创建要子集的个人列表,并将其命名为
sysys.txt
。接下来,运行以下命令,为sysys.txt
plink -bfile toy -emepy nyses.txt -make-bed-out toy_subset
plink plink> plink plink plink plink /code>希望这会有所帮助。
you can achieve that using PLINK. First, create a list of individuals that you want to subset and name it say
individuals.txt
. Next, run the following command to create a separate binary file for individuals in theindividuals.txt
plink --bfile toy --keep individuals.txt --make-bed --out toy_subset
Hope this helps.