Hi all,
I am Nahuel, a biotechnologist writing from Argentina. I am fascinated by the power of this platform, really guys, I think you are shaping the future here and now.
I have an idea to develop (a project to propose), that I want to share here briefly and get help from whoever wants to be part of the project. I am bad at programming, I am learning, but I need help to code from whoever wants to get involved in the project.
My idea is to search for coherent messages in the human genome. So, I need to code some programs to address the goal in different steps. First (I guess) we would have to create a program capable of finding coherence between the nucleotides of a DNA sequence (eg AACTGGTACC) and the characters of the different alphabets (coherence for formal words). Here comes my contribution from the biological sciences. The way to do it, according to nature, is by a triplet, that is, to code a program to find the best coherent correlation between triplets of nucleotides and human alphabets in relation to that decoding finds in the genome (DNA nucleotide sequnces) the largest number of coherent words possible. I have chosen triplets, because nature works in nucleotide triplets, I mean, triplets of nucleotides codes for an amino acids. Just exist 4 nucleotides (named witha a letter: A, C, T, G) to encode all the genomes. If, for example, the AAC nucleotid triplet codes for amino acid 1, and the ATG nucleotid triplet codes for amino acid 2, then the genomic sequence AACAACATG codes for the protein formed by the amino acid sequence 112. In nature there are only 4 nicleotides (A, C , T, G) and 20 amino acids (really 22 but I do not want to extend here with details), which make up all the diversity of living beings that we know. The combination of the 4 nucleotides in triplets gives (4x4x4 = 64) 64 triplets that code for the 20 amino acids (22) (obviously some are redundant, that is, more than one triplet codes for an amino acid). Thus, it is interesting to note that the human alphabet that most closely resembles this scheme is Hebrew, it already has 22 characters. With which we could start trying to find correlations between DNA sequences and the Hebrew alphabet. I mean, code a program to find the best coherent correlation between the 64 DNA triplets and the characters of the Hebrew alphabet in order to form words when serching human DNA sequences. At least two (I suspect more maybe) programms are needed, one to assign DNA triplets to characters in the alphabet, and another to, once the best triplet/character translation relationship is found, look up words (and perhaps messages) in DNA sequences.
Is there a hidden message in the human genome? Why would there be? I honestly don’t know, but if there is, it would be very interesting. The human genome has 3.2 billion nucleotides, I think we will find something. But it does not make sense to search the entire genome, I can provide short nucleotide sequences to start the project, that have the greatest possible biological relevance (I do not want to expand, but there are many different regions and types of sequences to search, I would choose the most relevant).
The second thing would be to try to translate the genetic code into some formal language (programming or mathematics), since, DNA is a programming language, a sequence that reads a turing machine (the cell) to carry out specific functions. But here I stop because otherwise it becomes very extensive. I hope someone is interested in my proposal, help me and let’s start having fun with this. Alive organism, and DNA nature itself is amazing complex to happen by their own, if we are an alien experiment as many good scientist believe, maybe exist some kind of hide messaje in DNA. Please do not think that I believe we are an alien experimnt (I just do not undestand how life happen), but this is an interesting approach to that idea. A big greeting to all.