Unlocking Drug Discovery with AI-Powered Literature Mining
![BioMiner systematically harvests large-scale bioactivity data-demonstrating a significant reduction in time and cost compared to manual curation of datasets like PDBbind v2016-and this expanded dataset, heavily skewed towards a limited distribution of proteins, substantially improves deep learning model performance when used for pre-training, as evidenced by consistent gains across five independent runs [latex] (n=5) [/latex].](https://arxiv.org/html/2604.21508v1/x3.png)
A new system leverages artificial intelligence to automatically extract crucial protein-ligand interaction data from scientific publications, accelerating the pace of pharmaceutical research.




![OptiMat Alloys demonstrates an intelligent interface capable of comparing the elastic properties of complex alloys-specifically [latex] BCC Co_4Cr_{10}Fe_5Mo_{11}Ni_5W_{19} [/latex] and equiatomic [latex] FCC CoCrFeNi [/latex]-by retrieving and averaging data from a cached database of 10 configurations per composition, all achieved through natural language querying without requiring additional computational simulations.](https://arxiv.org/html/2604.21850v1/fig6_query.png)
