The world’s largest cancer database – containing 1.7 billion experimental results – has been launched, with an aim to discover future treatments for the deadly disease.
The system, called CanSAR, is the biggest disease database of its kind anywhere in the world and condenses more data than would be generated by one million years of use of the Hubble space telescope.
It contains more than eight million experimentally derived measurements, nearly one million biologically active chemical compounds and data from over a thousand cancer cell lines.
The database will utilise artificial intelligence similar to the technology used to predict the weather to predict which potential drugs are likely to work in which circumstances.
The new CanSAR database developed by researchers at The Institute of Cancer Research, London, is more than double the size of a previous version and has been designed to cope with a huge expansion of data on cancer brought about by advances in DNA sequencing and other technologies.
The resource is being made freely available by The Institute of Cancer Research (ICR) and Cancer Research UK, and will help researchers worldwide make use of vast quantities of data, including data from patients, clinical trials and genetic, biochemical and pharmacological research.
Although the prototype of CanSAR was on a much smaller scale, it attracted 26,000 unique users in more than 70 countries around the world, and earlier this year was used to identify 46 potentially “druggable” cancer proteins that had previously been overlooked.
The new database will drive further dramatic advances in drug discovery by allowing researchers access to, and the ability to interact with, unprecedented amounts of multidisciplinary data in seconds.
CanSAR contains drug target information from the human genome and model organisms. Research that had previously taken months to complete will now take only minutes.
“The database is capable of extraordinarily complex virtual experiments drawing on information from patients, genetics, chemistry and other laboratory research. It can spot opportunities for future cancer treatments that no human eye could be expected to see,” Dr Bissan Al-Lazikani, Team Leader in Computational Biology and Chemogenomics at the ICR, said.
“This is an extraordinary time for cancer research, as advances in scientific techniques open up new possibilities and generate unprecedented amounts of data.
Our aim is to make this wealth of information, coming from both the clinic and from the laboratory, freely available in a very user-friendly form to as many people as possible,” Professor Paul Workman, Deputy Chief Executive of the ICR, said.