Roadmap
Our plan
Creating an integrated platform that connects public data, analysis tools, and community knowledge
Specific features planned for upcoming releases:
- Natural language query interface for ENA with automatic Galaxy integration
- Embedded Galaxy Pages (notebook-like documents containing workflow results) in relevant BRC Analytics pages
- Gene pages and gene sets with meaningful query and comparison capabilities
- Enhanced Galaxy workflows for comparative genomics and other analyses
- Priority pathogens/outbreaks tab highlighting specific organisms and relevant resources
- Integration with specialized tools like hgPhyloPlace from UCSC for phylogenetic analysis
- Importing gene model data from VEuPathDB
- Access to Galaxy's MCP server to provide a natural language interface to Galaxy itself
In building a collaborative environment where researchers can share insights and analyses, we aim to:
- Embed results of Galaxy analyses directly in the BRC Analytics site for community reference
- Empower users to self-curate organism knowledge bases through peer review
- Establish community forums for discussing research questions and methodologies
- Enable contribution of custom workflows, analysis pipelines and visualizations tailored to specific organisms
We recognize that each organism is different, with unique biology, research communities, and funding landscapes that drive diverse analytical needs. To address this, we intend to:
- Build a flexible platform that can adapt to the specific requirements of different organisms
- Accommodate varying levels of genomic resources, from well-studied model organisms to neglected pathogens
- Enable communities to prioritize the data types and analyses most relevant to their organism of interest
- Design adaptable interfaces that can evolve as research interests and priorities shift over time
- Support both emerging and established research communities with appropriate tools
While maintaining this flexibility, we will give special attention to NIAID-designated prototype pathogen viral families, biodefense priorities, and emerging threats.
We aim to create a unified experience that brings together diverse data sources and analysis capabilities. Our goals include:
- Automatically manage reference genomes and annotations from NCBI
- Provide an integrated and intuitive interface for ENA with automatic upload to Galaxy
- Import gene models and other data from public resources (e.g. VEuPathDB)
- Pre-build essential bioinformatics resources (indices, etc.) for supported organisms
- Create a comprehensive resource that reduces the need to navigate between multiple platforms
- Embed a wide diversity of analysis results from Galaxy as essential context for research on supported organisms
- Build a flexible system that can be extended to support additional organisms, data sources and analysis tools quickly
In order to make complex analyses accessible to researchers with varying computational expertise, we will:
- Leverage Galaxy's GUI-based platform, pre-built workflows and in-development MCP server, eliminating the need to learn command-line interfaces
- Utilize TACC's compute resources, removing the need for institutional or paid hardware and associated configuration
- Integrate curated, community-developed Galaxy workflows from the Intergalactic Workflow Commission (IWC)
- Create intuitive interfaces that bring together multiple resources in a single, unified experience
- Build on Galaxy team's strengths in tutorials and training networks to provide comprehensive educational resources
- Gather dedicated resources for priority pathogens in one place for streamlined research
- Implement LLM-powered natural language queries of data sources with automatic data upload to Galaxy