• Log in with Facebook Log in with Twitter Log In with Google      Sign In    
  • Create Account
  LongeCity
              Advocacy & Research for Unlimited Lifespans

Photo
- - - - -

Impute@Home: Anyone interested in a 40 million SNP Imputation from 23andme results?

imputation genetics impute2 snp

  • Please log in to reply
29 replies to this topic

#1 mag1

  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 27 December 2014 - 08:32 PM


 

 

I started a thread "Anyone interested in a 40 million SNP imputation from 23andme?". People did not seem that interested.
It was suggested that such a project should be in the Project Ideas Forum.

OK, Anyone interested in doing some imputation? You could take your 23andme results and impute 40 million SNPs.
It is not really that technically complex, though there are a few issues surrounding concordance that we could work thourgh.

Anyone want to help work through this?

First, we could get up and running with Impute2. It is not that difficult to get the program running. I posted the recipe on the other thread.
The first snag is understanding and hopefully correcting concordance issues.

Second, we might set up a impute@home project. Such a project could make a real contribution to humanity. The research community has done a massive
amount of imputation. If we set-up the infrastructure on this site, then they might off-load some of their work to us. I do not understand why this
has not been done before. I have been pushing this idea on several websites and I have been told to get lost.

I thought LongeCity would be a great site for this project, or should I just get lost?

 

 

Original discussion follows:

 

There is a free software program called Impute2 that can be used to impute tens of millions of SNPs from the 23andme chip.

Getting the program to run is not very difficult, though I have a few technical issues regarding the results of the program.

 

If anyone is interested, then we could collectively work through the technical issues. Doing so would provide those with gene chip results with a greatly expanded view of their genetics. Imputation is commonly used in GWAS studies.

 

Another issue that might need to be addressed is the huge download genome files that are now available for imputation. If these files were divided, then they would be much easier to manage. Such full genome imputation files might provide a very accurate

imputation result.

 

Anyone interested?

 

We might also want to set up an @HOME project for imputation. There is overwhelming demand for imputation. impute@HOME

could be a significant contribution to humanity.


Edited by YOLF, 13 March 2015 - 12:36 AM.


#2 mag1

  • Topic Starter
  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 28 December 2014 - 08:39 PM

For anyone interested in imputing their genome, this should be helpful.

 

-Download Plink 1.9 at https://www.cog-genomics.org/plink2/

Make a folder on the desktop, and put Plink 1.9 into it.

 

-Open a command window. Set the directory path using commands cd and cd.. to the folder with Plink in it. The command cd.. brings you up one directory. For example if current directory is C:\Users\A\Desktop> and the folder is named plink, then type cd Plink. [You need to open the DOS command prompt to do this. The DOS command prompt can be opened in Windows 8 by swiping the top right corner, typing "command prompt" in the search box, pressing enter, and clicking on the command prompt icon that should be displayed.]

-Read in your 23andme files with below commands. (Impute2 requires at least 2 people in the g file to function):

plink --23file genome.txt AA --out plink_genome1 plink --23file genome.txt BB --out plink_genome2

-Merge the 23andme files with below command:

plink --bfile plink_genome1 --bmerge plink_genome2 --make-bed --out combogenome1_trial

-Recode the Plink file to Impute2 format with below command:

plink --bfile combogenome1_trial --recode oxford --out combogenome1ox

-Download the Impute2 software from http://mathgen.stats.../impute_v2.html make a folder on the desktop and put the  Impute2 download into it.

 

-Set the directory path to the folder where you put the Impute2 software using the command prompt commands cd.. and cd

 

-Run the examples using the commands given on the Impute2 website.

 

-Download the 1000 Genomes reference files from the Impute2 site and put it into the folder with Impute2. (These files are huge {about 3GB}.)

 

-Unzip them and put them into the folder with Impute2.

 

-Run the below command to impute your 23andme file. (change the -int argument ranges to move along a chromosome; in the -m argument change the chromosome by changing "21" to the desired chromosome; and with the -o argument change the chromosome number to the desired chromosome and change the number in "Impute1" in order to have a record of your results.) {You might also need to change the name of the "ALL" files as the file names might be too long.}

 

impute2 -m genetic_map_chr21_combined_b37.txt -h ALL.chr21.integrated_phase1_v3.20101123.snps_indels_svs.genotypes.nomono.haplotypes -l ALL.chr21.integrated_phase1_v3.20101123.snps_indels_svs.genotypes.nomono.legend -g combogenome1ox -int 40.0e6 42.0e6 -Ne 20000 -o Impute1.chr21

 

-Check the concordance tables at the end of each run, the values should be in the 90+%. It is sometimes difficult getting high concordances. One thing to help with this is to change the composition of the g file.

-If this all works out, then you would be able to impute millions of SNPs from your 23andme file! ( I got almost 40 million from my 23andme file.) The results for SNPs with non reference allele frequencies above 10% have aggregate r2s in the mid .90s. ( http://mathgen.stats...T2_9-12-13.html )

 

This means that for most common SNPs you should receive an accurately imputed genotype. This would be very useful if you wanted to estimate your risk of a disease with many common variants.

 

Others on forum might be able to help out to make sure that the imputed genotypes are accurate. I am concerned about some of the low concordance results.



#3 treonsverdery

  • Guest
  • 1,312 posts
  • 161
  • Location:where I am at

Posted 26 February 2015 - 11:44 PM

although I do not have a 23andme genome description I like your software activity

perhaps this will refresh the participation opportunity among other viewers

 



#4 YOLF

  • Location:Delaware Delawhere, Delahere, Delathere!

Posted 12 March 2015 - 10:08 PM

Great idea! Though this really belongs in the project ideas forum. I would like to see this become a LongeCity project.

 

Also, what's the possibility of getting a 23andme chip for home use? FWIU it's just a USB thumbdrive like device? Could people use this for full genome sequencing?


  • Enjoying the show x 1

#5 mag1

  • Topic Starter
  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 12 March 2015 - 10:31 PM

I started a thread "Anyone interested in a 40 million SNP imputation from 23andme?". People did not seem that interested.
It was suggested that such a project should be in the Project Ideas Forum.

OK, Anyone interested in doing some imputation? You could take your 23andme results and impute 40 million SNPs.
It is not really that technically complex, though there are a few issues surrounding concordance that we could work thourgh.

Anyone want to help work through this?

First, we could get up and running with Impute2. It is not that difficult to get the program running. I posted the recipe on the other thread.
The first snag is understanding and hopefully correcting concordance issues.

Second, we might set up a impute@home project. Such a project could make a real contribution to humanity. The research community has done a massive
amount of imputation. If we set-up the infrastructure on this site, then they might off-load some of their work to us. I do not understand why this
has not been done before. I have been pushing this idea on several websites and I have been told to get lost.

I thought LongeCity would be a great site for this project, or should I just get lost?

#6 mag1

  • Topic Starter
  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 13 March 2015 - 12:32 AM

Might a moderator copy and paste the comments from the thread "Anyone interested in a 40 million SNP imputation from 23andme?" into this thread?

#7 niner

  • Guest
  • 16,276 posts
  • 2,000
  • Location:Philadelphia

Posted 13 March 2015 - 01:23 PM

It would probably be helpful to explain what imputation is, in the context of genetics, for people who are not familiar with it.


  • Agree x 2

#8 RobbieG

  • Guest
  • 40 posts
  • 3
  • Location:Atlanta
  • NO

Posted 14 March 2015 - 03:14 PM

Mag1. I just ordered the kit from 23andme and would be interested in anything that can help interpret the results. However i am confused. Aren't there already free websites we can go to that provide this snp imputation results? Am i missing something?

#9 mag1

  • Topic Starter
  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 14 March 2015 - 04:05 PM

Imputation is the use of statistics to predict genotypes given a set of SNPs.

http://www.illumina...._imputation.pdf


Simply running a million SNP gene chip provides much of the information content for the remaining
3 billion genetic letters.

If full genome scans (with phasing) were done on the parents of a child, it might not be necessary to
sequence the child. The child's genome could be known to very high accuracy by genotyping with a gene chip
(phased) and then filling in the genome with the parents' sequence. The only uncertainty using this approach
would be at the crossover regions. A typical chromosome has 1 chiasma (this is where the mother's and father's
homologous chromosomes switch over). The gene chip would help locate the location of these cross-overs. If the
cross-over points could be precisely found, the child would have an almost perfect readout on their genome.
The only differences that could arise would be from rare mutations, chromosomal rearrangements etc. .

Considering the large amount of full genome sequencing currently being done, the genomes of near relatives could
also be used to help determine sequences.

Using Impute2, I was able to impute 40 million SNPs from my 23andme results. Many of these appear to be high quality results.
It would be helpful if others on this forum went through the recipe that I posted so that we could make sure the results are
in fact accurate and work any problems that might be found.

#10 mag1

  • Topic Starter
  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 14 March 2015 - 04:13 PM

I am not sure what (if any) imputation services exist on the web. I have looked around and I have not been able to find such sites.
What websites do imputations?

Imputation has been a critically important tool for the scientific community when trying to extract information from GWASs. It is very
unclear why no @home project for imputation is currently running. There is a massive demand for imputation. When the 1 million 23andme
customers realize that they might be able to impute almost their entire genome with reasonable accuracy (using a free program), they might
show some interest in our project here on this forum. It has been frustrating trying to move this idea forward. I contacted 23andme and they told
me that they would not do an imputation for me (even though they know exactly how to do this and likely do such imputations on customers data frequently).

#11 lassefolkersen

  • Guest
  • 2 posts
  • 2
  • Location:Denmark
  • NO

Posted 29 September 2015 - 05:52 PM

I just completed a prototype server to do just that. You can test it at:

www.impute.me

 

... I plan to add analysis modules from here on, now that I have so many more SNPs available. Help very welcome.

 


  • like x 1

#12 mag1

  • Topic Starter
  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 29 September 2015 - 06:40 PM

Yeah Denmark!

 

I have been kicking around this idea for quite some time and got nowhere.

Quite a few of the people were downright rude and indignant when I suggested it on their sites.

It made no sense to me!

 

This imputation program could really help humanity!

I have absolutely no idea why this has not been done on the internet before.

I thought the idea was more to use the @home model.

This way the computational power of those on the internet could be used to get the job done.

 

For my purposes of getting a good imputation done it will not matter.

One thing that I have not been sure about is how one would handle the large datasets that have emerged such as the UK 10K.

 

Imputing with Impute2 on a desktop with 1000 Genomes was doable.

However, I am not sure whether it is possible anymore with these newer datasets.

 

It would be just so awesome if there were a downloadable program that would allow you to input a 23andme file, access a 1 million base portion of the genome 

in a large whole genome set such as UIK 10K, and then do an imputation for your file over this 1 million stretch. That way your desktop could still handle

the datafile.

 

 

Great job for this program!

Thank you very much!

(The damsel in distress strategy works)

 

Does your program allow uploading a VCF file from an exome scan?

Having information on the full 62 million base pairs on the exome would likely greatly increase the accuracy of any imputation.

 

 

 

 

 



#13 mag1

  • Topic Starter
  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 29 September 2015 - 06:44 PM

I just tried to upload the 23andme file and it said that file size was exceeded.

Will the program work if I break the file up into parts?



#14 mag1

  • Topic Starter
  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 29 September 2015 - 07:07 PM

I just uploaded a partial 23andme file that was successfully received by the website.

However, I forgot to change the email notification from your email to mine.

 

Sorry!

 

The imputation result will be sent to you and not me.



#15 Danail Bulgaria

  • Guest
  • 2,213 posts
  • 421
  • Location:Bulgaria

Posted 29 September 2015 - 07:18 PM

Sorry for the noobie question, but how exactly this chip is supposed to make us immortal? 



#16 niner

  • Guest
  • 16,276 posts
  • 2,000
  • Location:Philadelphia

Posted 29 September 2015 - 09:20 PM

Sorry for the noobie question, but how exactly this chip is supposed to make us immortal? 

 

Imputation isn't a chip, it's something you do with data from the chip.  It lets you use the SNPs from the chip to predict the identity of bases at other positions in the genome.  A better question would be "How does knowing our genome help make us immortal?"  The answer is that it might let you know that you need an unusual form of a vitamin, or that you would do much better with a particular macronutrient ratio, or that you are at risk for a particular condition.  These are all forms of actionable intelligence that might help you optimize your health.  Optimizing your health, aside from making you feel better, may help you to live long enough to still be alive when regenerative therapies become available.  It's a part of the 'live long enough to live "forever"' strategy.



#17 Danail Bulgaria

  • Guest
  • 2,213 posts
  • 421
  • Location:Bulgaria

Posted 29 September 2015 - 09:49 PM

This is not for me. I would rely on the current medicine for living long enough. Nice reading, though. 


  • Agree x 1

#18 mag1

  • Topic Starter
  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 30 September 2015 - 01:07 AM

Probably best to disguise the fact that you are using Impute2 for the imputation.

{Impute2 is shown on the Kickstarter page.}

 

It would be a good idea not to note this fact.

Those who wrote Impute2 wanted recognition or payment if their program was used for a commercial purpose or otherwise.

 

All you need to do is omit reference to what software package is doing the imputing.

That should clear things up.



#19 niner

  • Guest
  • 16,276 posts
  • 2,000
  • Location:Philadelphia

Posted 30 September 2015 - 02:19 AM

This is not for me. I would rely on the current medicine for living long enough. Nice reading, though. 

 

But genomics is current medicine.  The only question is do you want to pay a couple thousand dollars for a test that your doctor orders that identifies one or two SNPs, or do you want to pay 99 dollars for a million SNPs?  The first way is called "modern medicine" and the second way is called 23andMe.


  • Cheerful x 1

#20 mag1

  • Topic Starter
  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 30 September 2015 - 02:29 AM

Um, and of course given the nature of this thread one might be then able to take the 23andme result and impute 4 million high quality SNPs for free!



#21 niner

  • Guest
  • 16,276 posts
  • 2,000
  • Location:Philadelphia

Posted 30 September 2015 - 02:56 AM

Um, and of course given the nature of this thread one might be then able to take the 23andme result and impute 4 million high quality SNPs for free!

 

Yes, that's a good point.  One thing that makes me hesitant to go to the trouble of imputation is a concern that I wouldn't get a much (or any) actionable intelligence out of it.  The latest generation of chips presumably have the most interesting/useful SNPs; how many of the remaining / imputable SNPs would tell us something we could use?



#22 mag1

  • Topic Starter
  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 30 September 2015 - 03:16 AM

You might be surprised! 23andme appears to have went to no effort in providing imputation for their customers even though it is a widely

used technique in GWAS studies. When 23andme decide on which genechip they will use there is no way for them to predict which SNPs will be

those which have disease relevance. They just have to sort of guess.

 

On another disease site it was found that 23andme did not even bother reporting SNPs that have 100% correlation with the SNPs on their chip.

A GWAS study reported a result with such a SNP that was not on their chip but had a SNP that was a perfect proxy. Without "imputing" you would

not know what your genotype was.

 

So, getting those extra 4 million SNPs for free does make quite a bit of sense. And it looks like the output would be of only high quality SNPs. 

I ran the program and got 40 million SNPs, though it makes so much more sense to only use the results that were highly likely to be accurate.

This is what genomic experts do.

 



#23 mag1

  • Topic Starter
  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 30 September 2015 - 03:15 PM

This is amazing! All you have to do in the internet age is tell people what you want done, provide some of the technical background and someone

will do it for you. Very Awesome! 

 

It appears that one of my ideas has actually made it out into the world. I would think of the million or so people who have genotyped on 23andme, a fair few of them might be interested in 4 million more SNPs.

 

One suggestion I have is to move the computing for this onto the cloud. I am quite sure that there are some massive commercial cloud providers that

would be all too happy to sell you nearly unlimited computational power and storage. Such computer services likely would not be that expensive. Trying to do this all on your one server that can process 3 or 4 

gene chip files a day probably will not be enough. By getting onto the cloud you would have nearly unlimited ability to scale. You could impute

anywhere between 1 and 1 million gene chip files on the cloud with no problems. This would also be a better idea because with the cloud you could

do a massively parallel impute which might be able to generate a result almost instantly.

 

Thank you again for moving this idea forward!


Edited by mag1, 30 September 2015 - 03:17 PM.


#24 mag1

  • Topic Starter
  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 03 October 2015 - 05:26 PM

This is just so amazing!

I have ideas that I float around all the time and generally they never go anywhere.

Well, actually they have never ever went anywhere!

 

Until now.

 

I put all the pieces of the puzzles on the table to make it easy for someone to put it together.

All it took was the final step and this idea could take off. It is the old story you need to have

as few hurdles to success as possible or something will trip people up. With this it wasn't only doing the imputation.

Impute2 will do an imputation almost right away. The important thing is to ensure that proper quality control is 

done and the imputed results are correct.

 

Yes, I am feeling quite nice about how all of this has turned out.

Imputation is an important and widely used technique in the scientific community and now there is an online service provider that can offer it to the public.

 

 

I like to think that every once in a while I can make a contribution to the advancement of human civilization.

This project has now made such a contribution to humanity.

 

It would not be unexpected if I will now start to receive substantial recognition on the forum for the part that I have played in moving this glorious achievement to fruition. As I mentioned playing a part in a team is the way things are usually achieved in life. There were a few different skills required to move imputation over the line. No one person would likely have all that would be required for success. I suppose some positive ratings would now be in order. Perhaps a new rating could be introduced that highlighted an idea or project suggestion that not only was proposed but also implemented! The rating might be described as a world changer!

 

 

 

I have now received my imputed results from the link above and I am quite happy.

There are now millions and millions of imputations!

Many of these results can now be used to check my exome variant file!

 

 

As a suggestion for anyone thinking of doing an exome scan, it would be wise to insist that

the scan was coupled with a cross-referenced imputation. Many of the most interesting results form the exome scan were

of low quality. Quite a few of these low quality variants had high number of reads though for various reasons ( strand bias etc. )

were still considered suspect. With a simple and inexpensive imputation, one can now cross-reference the exome scan against

the imputation. It is not obvious to me why this would not be considered the standard of practice.

 

And of course taking the imputation file and running it on Promethease should now generate a massive number of new health insights.

 

 

This is all still being worked out so there are a few remaining snags yet present.

It is currently not clear whether the file is actually in a form that Promethease could read.

This will be an easy fix. There are just a few formatting issues that might be a problem.

The only one that would need to be fixed at the program level is the inclusion of NNs as genotypes in the imputation files.

These were the non-imputables. It is good that they are included, though this might throw off Promethease.

The simple workaround for them is to simply make them comments in the file.

 

Including imputation of mitochondrial DNA would also be welcome. However, for some reason there is a farily limited online

database of mt sequences.

 

 

Another issue that needs to be mentioned is that a fairly low threshold has been used to qualify as a call (90%).

Some have criticized the accuracy of 23andme's results based on the argument that the multiple comparison of

almost 1 million SNPs even at very high accuracy (99.9% +) will leave many errors. These errors will often be

the SNPs that are reported as having health significance. The multiple comparison problem will greatly

intensify with this imputation. The program is not providing the actual number of imputed SNPs (It would be helpful if it did.)

However, by using a line counter command on the smallest autosomal chromosome (number 22), I found that there were

almost half a million SNPs in the imputation. There appear to be many many millions of imputed SNPs. Possibly even more than 10 million. Though some of the genotypes would have a probability of being correct as low as 90%.

 

I am so happy this has finally happened.

It will be very helpful for the many people with gene chip files who want an estimate of the genotype of other SNPs not included on the chip.

Importantly, many of the SNPs reported in the Imputation file had an imputed genotype probability reported as 100%.

It should also be kept in mind that when the SNPs were chosen for the gene chips there was no way of knowing what SNPs would be found important for health traits. Of course there is also no guarantee that a SNP on the 23andme chip will be recognized by Promethease even when there is a 100% proxy for it. This imputation service could be a great help to many people.     

 

 

 

Humanity you are most welcome!

 

 

       

 

 

 

 

 

 


Edited by mag1, 03 October 2015 - 05:50 PM.

  • Enjoying the show x 1

#25 mag1

  • Topic Starter
  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 04 October 2015 - 06:01 AM

I just determined the number of genotype calls in the gen file for my imputation from the above url for chromosome 20.

 

There were over 851,000 SNPs imputed for chromosome 20.

751,831 or 88.3% were imputed at 100% probability.

15,001 or 1.8% were imputed at 99% probability.

 

It was a little awkward running the program to determine this because there were some unknown regions called Merged-deletes that interfered.

(Not sure what they were.)

 

However, if an extrapolation is made using the chromosome 20 result onto the full genome, then it is expected that 53 million SNPs were imputed.

This would result in 47 million 100% probability imputed SNPs.

 

So this imputation resulted in a simply overwhelming number of high quality SNPs. This is approaching the total value of variants that would be expected 

from a full genome scan.

 

 

 



#26 niner

  • Guest
  • 16,276 posts
  • 2,000
  • Location:Philadelphia

Posted 05 October 2015 - 01:58 AM

Mag1, there's something I don't get about imputation.  They say that they know an imputed SNP with 100% probability, but if that's based on a statistical model from a couple thousand full genomes, how do they know that you don't have a mutation at that position?

 

Another question:  How many of these imputed SNPs are known well enough to be found in Promethease?



#27 mag1

  • Topic Starter
  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 05 October 2015 - 02:36 AM

Imputation is probably picking up on the deep ancestry signal. The statistically most likely situation would be that if you had a mutation at a locus then others in your

family down the generations would also have this mutation and it would be found in the sequencing used for the reference sample. Word is that imputation using the

10K UK is even more accurate than the 1000 Genomes dataset. The reference samples are now picking up on fairly subtle haploblock patterns in the UK population. I am becoming increasingly unclear whether full genome sequencing will ever be necessary on a population scale. On the cool scale, well sure it is a must have, though more on a money wise smart scale, maybe not so much.

 

So the idea is that it is fairly remote that a mutation that you have would have arisen very recently. Accurate imputation would become almost impossible if genomes were especially mutation prone.  

 

It is hard to say about how many additional SNPs would be in Promethease as a result of moving to an imputed file. The nearly 50 million additional high quality SNPs that I have now starts to approach a full genome scan without the cost and without the numerous unavoidable sequencing artifacts that arise. In our exome scan most of the really interesting variants that gave me the most terror were likely simply errors. Out of the 60 million base pairs in the exome the 1 in 1000 variants that were reported were likely highly enriched with errors. The imputation now is clearly showing that many of these were not correct. I was going to spend a whole bunch more money trying to unwind all the errors from the exome scan, though now the $30 imputation has pretty much made that unnecessary. Insisting on a coupled exome scan imputation would make quite a bit of sense.

 

It is probably also important to realize that much of the imputation is sort of bogus. Many of the 100% SNPs are probably simply those with perfect proxies. You could go to the Broad site and figure these out for yourself. However, having it all wrapped up for you in one nice file makes everything so much easier. If some of these genomics companies would simply add the imputation into their service then it really would not matter very much about imputing. Why not simply tell the computer to report all SNPs that have a perfect proxy on the 23andme chip? As long as this is not done there will be an advantage in having your file imputed. Promethease has been fairly clear that they do not think of themselves as 23andme's back office. They are interested in serving a wide range of genomics customers including those with full exome, full genome and others.    



#28 niner

  • Guest
  • 16,276 posts
  • 2,000
  • Location:Philadelphia

Posted 05 October 2015 - 04:09 AM

The nearly 50 million additional high quality SNPs that I have now starts to approach a full genome scan without the cost and without the numerous unavoidable sequencing artifacts that arise. In our exome scan most of the really interesting variants that gave me the most terror were likely simply errors. Out of the 60 million base pairs in the exome the 1 in 1000 variants that were reported were likely highly enriched with errors. The imputation now is clearly showing that many of these were not correct. I was going to spend a whole bunch more money trying to unwind all the errors from the exome scan, though now the $30 imputation has pretty much made that unnecessary. Insisting on a coupled exome scan imputation would make quite a bit of sense.

 

But highly unusual variants are precisely the ones that imputation would miss, by definition.  Imputation will work well for segments of the genome that were modified by rearrangement, providing they are old enough to be widely represented in the entire human genome.  The place where it falls down is mutational events that are relatively recent.  (and by "relatively", I mean hundreds or maybe even a few thousand years)  I wouldn't toss out the exome data on the basis of the imputation.  The exome scan is a real measurement of your genome, while the imputation is telling you what the average human has at those loci.



#29 mag1

  • Topic Starter
  • Guest
  • 1,061 posts
  • 134
  • Location:virtual

Posted 13 October 2015 - 03:59 AM

snpedia is going to accept imputed files!

Their policy had recently been somewhat ambiguous. However in the last few days, there has been word that they will in fact be willing to run imputed files on Promethease.

 

Something that should also be noted is that with the level of sequencing currently being done, we are not far off the time in which everyone in the population will be within a few generations of someone with a full

genome scan. Imputation will no longer be in the context of possibly being thousands of years distant from such a relative. Perhaps for most people gene chips and imputation will result in a synthetic full genome

scan that is accurate enough.

 

The next service that I will be interested in is phasing.



#30 Danail Bulgaria

  • Guest
  • 2,213 posts
  • 421
  • Location:Bulgaria

Posted 13 October 2015 - 08:01 AM

Instead of waisting your time with junky science, why not you focus on stem cells?







Also tagged with one or more of these keywords: imputation, genetics, impute2, snp

1 user(s) are reading this topic

0 members, 1 guests, 0 anonymous users