Novel ACE-I Inhibitory Peptides from L. platensis
BioBloom is a research pipeline focused on mining microalgae proteomes to find bioactive peptides, specifically targeting ACE inhibitors for cardiovascular health.
This repository contains scripts used throughout our screening workflow—from querying databases and running ADMET predictions to generating 3D structures for molecular docking.
- Data mining: Extracting proteomes and deduplicating sequences.
- Bioactivity screening: Automating BIOPEP-UWM queries via Selenium.
- ADMET profiling: Cleaning and unifying results from AdmetLab 3.0.
- Cheminformatics: Converting formats (FASTA ⇄ SMILES ⇄ PDB).
- Machine Learning: Formatting data for the pLM4ACE model.
- Structure prep & docking: Using PyRosetta to generate peptide conformations and prepare the ACE receptor.
Automates batch processing for the BIOPEP-UWM database.
selenium_biopep_batch_processing.py– Screens for ACE inhibitory activity.selenium_batch_processing_scraper.py– Scrapes enzyme action analysis.search_for_novel_peptides.py– Compares outputs against known ACE inhibitors.unify_a_platensis_biopep_output.py– Merges and deduplicates species-specific outputs.
unify_admet_output_a_platensis.py– Merges ADMET screening outputs into a unified dataset.
Prepares 3D structures for docking.
generate_peptide_structures_pyrosetta.py– Builds PDBs from FASTA sequences using PyRosetta.repack_receptor_pyrosetta.py– Repacks side-chains of the ACE receptor.place_pep_into_ace.py– Places the peptide directly into the binding pocket.select_top_peptides_for_molecular_docking.py– Ranks and filters peptides based on ADMET criteria.
Utilities for the pLM4ACE predictive model.
prepare_pLM4ACE_input.py&split_pLM4ACE_input.py– Prepares and batches input files.unify_pLM4ACE_output.py– Merges model predictions.
smiles_converter.py– Bidirectional FASTA ⇄ SMILES conversion.create_fasta_input_for_smiles_conversion.py– Preps input FASTA files.extract_smiles_without_names.py– Cleans SMILES files (keeps strings, drops labels).
- Python 3.9+
- PyRosetta (requires license and manual installation, see docs)
- Chrome & Chromedriver (managed via
webdriver-manager)
pip install biopython selenium tqdm openpyxl pandas numpy rdkit-pypi