phytoai-assistant / README.md
TANTCHEU Noussi CΓ©dric
Initial space upload: Interactive PhytoAI Assistant
7603b2e

A newer version of the Streamlit SDK is available: 1.46.0

Upgrade
metadata
title: PhytoAI Assistant
emoji: 🌿
colorFrom: green
colorTo: blue
sdk: streamlit
sdk_version: 1.28.0
app_file: app.py
pinned: false
license: cc-by-4.0
tags:
  - phytotherapy
  - natural-compounds
  - bioactivity
  - drug-discovery
  - ai-assistant
  - research-tool

PhytoAI Assistant 🌿

Interactive AI Assistant for Phytotherapy Research - Explore natural compounds and their bioactivities using cutting-edge AI technology.

🎯 Overview

The PhytoAI Assistant is an interactive web application built with Streamlit that provides researchers, students, and pharmaceutical professionals with easy access to a comprehensive database of natural compounds and their documented bioactivities. This tool leverages the PhytoAI MEGA Dataset containing 352 unique natural compounds and 1,314 bioactivities.

✨ Key Features

πŸ” Advanced Search Capabilities

  • Compound Name Search: Find specific natural compounds by name (e.g., curcumin, resveratrol, quercetin)
  • Therapeutic Activity Search: Discover compounds by their therapeutic properties:
    • Anti-inflammatory
    • Antioxidant
    • Cardiovascular protective
    • Neuroprotective
    • Anti-cancer
    • Antimicrobial

πŸ“Š Interactive Data Visualization

  • Real-time Statistics: Live metrics on compound count, bioactivities, and therapeutic coverage
  • Distribution Charts: Visual analysis of therapeutic activities using Plotly
  • Pie Charts: Therapeutic area distribution for quick insights
  • Bar Charts: Activity type frequency analysis

🧬 Comprehensive Compound Information

  • Molecular Properties: Chemical formulas, SMILES notation, molecular weights
  • Database Cross-references: PubChem CID mappings for further research
  • Bioactivity Profiles: Detailed activity descriptions with experimental context
  • Literature References: Links to original research and validation studies

πŸš€ How to Use

  1. Launch the Application: The interface loads automatically with dataset statistics
  2. Search Compounds: Use the sidebar to search by compound name or therapeutic activity
  3. Explore Results: Click on compound cards to see detailed molecular and bioactivity information
  4. Analyze Data: Review interactive charts to understand therapeutic distribution patterns
  5. Cross-reference: Use PubChem CIDs for additional research in external databases

πŸ“ˆ Dataset Integration

This application seamlessly integrates with the PhytoAI MEGA Dataset through Hugging Face's hf_hub_download functionality, ensuring:

  • Always Up-to-date: Automatic synchronization with the latest dataset version
  • Efficient Loading: Cached data loading for optimal performance
  • Reliable Access: Robust error handling and fallback mechanisms

πŸ”¬ Research Applications

Academic Research

  • Drug Discovery: Identify promising natural compounds for pharmaceutical development
  • Ethnopharmacology: Validate traditional medicine uses with modern bioactivity data
  • Chemical Biology: Explore structure-activity relationships in natural products

Pharmaceutical Industry

  • Lead Compound Identification: Screen natural products for specific therapeutic targets
  • Bioactivity Prediction: Use existing data to guide synthetic chemistry efforts
  • Competitive Intelligence: Monitor natural product research trends and opportunities

Educational Use

  • Teaching Tool: Interactive exploration of phytochemistry and pharmacology concepts
  • Student Projects: Real-world dataset for bioinformatics and cheminformatics training
  • Research Training: Hands-on experience with pharmaceutical data analysis

πŸ› οΈ Technical Architecture

Frontend

  • Streamlit: Modern, responsive web interface
  • Plotly: Interactive data visualizations
  • Pandas: Efficient data manipulation and analysis

Backend

  • Hugging Face Hub: Dataset storage and version control
  • JSON/CSV: Structured data formats optimized for research
  • Caching: Optimized performance with Streamlit's caching system

Data Pipeline

Pandas Processing β†’ Streamlit Interface β†’ Interactive Visualizations

πŸ“Š Dataset Statistics

  • 🧬 Compounds: 352 unique natural products
  • πŸ”¬ Bioactivities: 1,314 documented activities
  • 🎯 Therapeutic Areas: 6+ major categories
  • πŸ“š Sources: PubChem, ChEMBL, peer-reviewed literature
  • πŸ”„ Updates: Regularly maintained and expanded

🌐 Related Resources

πŸ† Impact & Recognition

This tool has been designed to bridge the gap between traditional phytotherapy knowledge and modern AI-driven drug discovery, providing researchers worldwide with:

  • Accessible Data: User-friendly interface for complex bioactivity data
  • Research Acceleration: Rapid compound screening and hypothesis generation
  • Global Collaboration: Shared platform for international research initiatives
  • Educational Value: Training resource for next-generation researchers

πŸ“„ Citation & License

License: CC BY 4.0 - Free for academic and commercial use with attribution

Citation: If you use PhytoAI Assistant in your research, please cite:

@software{phytoai_assistant_2025,
  title={PhytoAI Assistant: Interactive AI Tool for Phytotherapy Research},
  author={Tantcheu, Cedric},
  year={2025},
  url={https://huggingface.co/spaces/Gatescrispy/phytoai-assistant},
  note={Interactive Streamlit application for natural compound bioactivity exploration}
}

πŸ‘¨β€πŸ’» About the Developer

Cedric Tantcheu - AI & Phytotherapy Research Specialist

  • πŸŽ“ Expertise in cheminformatics, machine learning, and natural product research
  • πŸ”¬ Focus on AI-driven drug discovery from traditional medicine
  • 🌍 Committed to open science and global health solutions

🌿 Advancing phytotherapy research through intelligent technology

Built with ❀️ for the global research community