Paper Discovery Feed

Explore domain hubs and recent submissions

PDF Preview

We train protein language models up to 15B parameters and find that as models scale, information emerges in the representations that enables accurate atomic-resolution structure prediction.

CodeRead PDFarXiv:2207.06616
PDF Preview

We present scGPT, a generative pretrained transformer model for single-cell biology that enables cell type annotation, multi-batch integration, and perturbation response prediction.

CodeRead PDFarXiv:2302.02867
PDF Preview

We generate gene embeddings by converting NCBI gene summaries into vector representations using GPT-3.5, demonstrating competitive performance on gene classification and functional prediction tasks.

CodeRead PDFarXiv:2306.15462
PDF Preview

We present the AlphaFold DB, providing open access to 200 million protein structure predictions, covering nearly all catalogued proteins known to science.

CodeRead PDFarXiv:2209.15474