There are more 64-amino acid sequences than atoms in the universe. The vast majority of these are non-functional.
OpenProtein.AI maps the sequence universe to focus only on the tiny subspace of functional proteins.
Traditional protein engineering approaches limit searches to known cladistic branches. These approaches are slow, expensive, and miss novel variants from families distant to original sequences.
OpenProtein.AI learns from natural proteins to generate novel functional protein sequences, covering a much broader range of the cladistic sequence tree around any starting amino acid sequence.
OpenProtein.AI designs focused mutagenesis libraries that reflect functional constraints and epistasis, yielding higher success rates. No protein structures required.
OpenProtein.AI achieves state-of-the-art function prediction accuracy for any protein, for any property, for anybody.
Our iterative design framework integrates with your assays to predict function from sequence. Our model adapts to any design goal you have, and learns from your functional measurements in a continuously self-improving cycle.
Seamless integration with your assay data enables OpenProtein.AI to optimize your functional properties directly. No surrogate properties like stability or binding affinity required.
OpenProtein.AI can optimize multiple properties simultaneously. Consider all of your properties earlier in the engineering process.
Bepler, T. and Berger, B., 2021. Learning the protein language: Evolution, structure, and function. Cell systems, 12(6), pp.654-669.
Bepler, T. and Berger, B., 2019. Learning protein sequence embeddings using information from structure. ICLR 2019.
Ram, S. and Bepler, T., 2022. Few Shot Protein Generation. arXiv preprint arXiv:2204.01168.
Large scale deep learning models are unlocking new capabilities in protein engineering, but these models are expensive to deploy and require teams to develop. Our democratized platform brings state-of-the-art machine learning tech to the bench, with a web-based application that is accessible to non-computer scientists. No expensive infrastructure, or setup, or specialized skill-sets required.
We offer free access to academic researchers, and commercial options range from platform access to partnerships.
Derive insights from your mutagenesis data to design optimized libraries, accelerate design-build-test iterations, and track your progress all in one easy-to-use web app.
We work directly with your team and data to understand your protein, assay, and design goals. We specialize our platform to your use case and deliver custom libraries designed to meet your specifications. You build and assay those sequences in your system.
Tristan is a machine learning scientist, group leader of the Simons Machine Learning Center at the New York Structural Biology Center, and CEO and co-founder of OpenProtein.AI. He received his PhD from MIT in the Computational and Systems Biology Program. Before starting OpenProtein.AI, Tristan pioneered large language models for learning protein sequence representations and their application to protein property prediction. He is also passionate about machine learning methods for understanding protein structures in native states and accelerating cryo-EM.
Dr. Lu is a serial biotech entrepreneur and faculty member in Electrical Engineering and Computer Science, Biological Engineering at MIT. Dr. Lu has been a co-founder and a Scientific Advisory Board member to a number of biotechnology and biopharmaceutical companies, including Senti Bio, BiomX, Corvium, Eligo Bioscience, Engine Biosciences, Synlogic and Tango Therapeutics.