In my present job as Community Information Assistant at Ilfracombe Library, I am involved in maintaining and searching a database of local organisations. This constantly involves me in the structures of relationship between different organisations and reminds me of the differences between database structures and hypertext structures. My current web pages were inspired by my inability to construct an effective bibliography with a database. I am currently looking at ways in which databases and hypertext can be linked.
This does not mean that any classification system is as good as any other. Venter's classification is partially based on the a posteriori classifications given to the proteins by their researchers. These researchers that know some of the pathways in which these proteins are involved within the cellular context. The evidence does suggest that both hierarchies and webs are involved in cellular organisation. In this way, ecological approaches to multi-cellularity might ask informative questions.
Translating this problem back to the genotype-phenotype question, can we ask - how is our set of protein forms related to our set of protein functions? If we take as given that genes determine protein form can we also assume that genes determine all protein functions? To answer this question we must assume that protein function is independent of protein form, and then see how the two relate. The number of genes involved in human genome makes computational methods necessary.
In computational approaches we must ensure that the assumption of form-function independence is preserved so that the results will be a property of the data and not a computational effect. This can therefore be expressed as a problem in bio-informatics - how can we classify a database so that the functionally properties that we wish to study are preserved? In this way, computational and linguistic approaches to multi-cellularity might ask informative questions.
Venter et al. (1995) "Initial assessment of human gene diversity and expression patterns based upon 83 million nucleotides of cDNA sequence", Nature 377 Suppl., pp. 3-174