Please enable JavaScript to access the full functionality of this site.

History

The Explosion and Evolution of Chemical Biology Data

BARD’s challenge is complex, but its goal is quite simple: To unlock the full potential and value of years of research to empower scientists and benefit all mankind.

Consider how far technology has come in the eight years since the founding of the National Institute of Health’s Molecular Libraries Program (MLP): Computers with amazing power, datacenters with staggering capacity, all connected by a robust and mature Internet. Just as technology continuously transforms itself, so it transforms the work of researchers.

Since 2005, the MLP has endeavored to harness technology to expand and catalog the relationships between individual chemical entities and biological functions. Through a network of High Throughput Screening (HTS) centers, the MLP has generated reams of bioassay data over the course of thousands of screening campaigns, many of which producing promising probes. However, this data, published with little standardization to PubChem, tends to be limited by the moment, place and technology in which it was created.

Over the same eight years, numerous other public and private online databases have appeared, each with their own ontologies and blend of various data: chemical structures, patent information, compound activity, affinity data, biological activity, and others. Also, pharmaceutical companies have begun developing their own proprietary systems to integrate various types of data to meet their specific needs.

For the biomedical research community, all this promising technology has resulted in a labyrinth of resources and overly complex tools. Also, publically shared data too often lacks the proper contextual information for interpretation. Worse still, it lacks adequate quality controls (e.g., five established databases disagree on the structure of 150 top-selling drugs.) Lastly, pharmaceuticals is a tight-lipped business where even trace evidence of multimillion dollar IP left in the public domain can compromise a company and end careers. The result is an environment that is inhospitable for empirical science, informed business strategy or unfettered exploration.

So while technology enables us to gather exponentially more data, at the same time it can obscure valuable information and true knowledge in a haze of noise that makes it difficult to know what is known and discern what is significant. But not anymore, thanks to BARD.