Web Date: December 13, 2011
PubChem Gets More Data
IBM announced plans to give the National Institutes of Health a database of more than 2.4 million chemical compounds. NIH will add this information to PubChem, a freely available database of chemical structures of small organic molecules and information on their biological activities.
The chemical compound data was pulled from about 4.7 million patents and 11 million biomedical journal abstracts from 1976–2000. According to IBM, the newly compiled data will help researches more easily visualize important relationships among chemical compounds and aid in drug discovery.
The database created by IBM was done in collaboration with AstraZeneca, Bristol-Myers Squibb, DuPont, and Pfizer. The compound data was extracted using IBM’s strategic IP insight platform, a combination of data and analytics delivered by the IBM SmartCloud. The platform uses techniques such as automatic image analysis and enhanced optical recognition of chemical images and symbols.
Click here to watch a video about the platform.
- Chemical & Engineering News
- ISSN 0009-2347
- Copyright © American Chemical Society