Comment on DNAddy

rockSlayer@lemmy.blahaj.zone ⁨5⁩ ⁨hours⁩ ago

I’m a data analyst at a medical nonprofit, primarily doing analyses on germline variants for rare forms of cancer. I’m new to this kind of work, but had a decent educational background in biology.

Something I’ve learned is that genetics are complicated as hell. A single gene can produce multiple different proteins, and proteins change over time due to somatic variation. Only 1% of the genome are protein coding, called exomes. Exomes can be affected by variations to start and stop codons, non coding regions, and untranslated regions. There are entire fields dedicated to studying genome-wide, exomics, transcriptomics, proteomics, phenomics, and probably several others that I don’t know about. The amount of data involved with these fields is in the tebibytes region. Have you ever seen a “small” 3GiB csv? I have. The filtered and cleaned data frames created by genetics are over 100 columns wide and have nearly 5 million entries.

There are companies creating artificial life by generating custom chromosomes. There’s a whole field of computer science dedicated to biological computing, using DNA as a storage medium. There are companies dedicated to simply classifying genes.

DNA is cool as hell.

source
Sort:hotnewtop