cag VirAIPowered by Deep Learning

Predict whether a virus infects insects, mammals, plants or birds using state-of-the-art 1D CNN Deep Learning model and advanced sequence analysis.

Engineered for Precision

A purpose-built pipeline combining multiple machine learning models with advanced bioinformatics for reliable predictions.

Fast Predictions

Get results in seconds with our optimized DL pipeline

High Accuracy

Our 1D CNN model ensure reliable predictions

1D CNN Model

Captures local motifs and global sequence pattern

Total Fragments for Model Training

600bp - 346738 fragments of 3948 unique viral tax ids

1D CNN Model Valiadtion Accuracy

Our hierarchical multi-task convolutional neural network (CNN) trained to predict viral taxonomy & host associations directly from 600-nucleotide RNA viral genome fragments. Our model prediction has minimum length threshold of 200bp to complete genome.

🦟

Insect

Host Class

Model Accuracy94.2%

Identifies genomic signatures specific to viruses that infect insect.

🦠

Mammal

Host Class

Model Accuracy94.2%

Identifies genomic signatures specific to viruses that infect mammal.

🌱

Plant

Host Class

Model Accuracy94.2%

Identifies genomic signatures specific to viruses that infect plant.

🦅

Avian

Host Class

Model Accuracy94.2%

Identifies genomic signatures specific to viruses that infect avian.

Quick Start

Run Your First Prediction

No Data? No Problem

Use our curated example sequence to test the platform's full analysis pipeline.

Full Analysis in <10s

See feature extraction, model inference, and confidence scoring in real-time.

example.fasta
NC_123456.1
# Arabidopsis thaliana virus
>NC_123456.1 | Plant virus
ATGCGTACGATCGATCGATCGATCGATCGATCGATCGATCGATCGATCGATCGATCGATC GATCGATCGATCGATCGATCGATCGATCGATCGATCGATCGATCGATCGATCGATCGATC ATGCGTACGATCGATCGATCGATCGATCGATCGATCGATCGATCGATCGATCGATCGATC GATCGATCGATCGATCGATCGATCGATCGATCGATCGATCGATCGATCGATCGATCGATC >Example_Virus_Sequence_2 | Another sample sequence GCTAGCTAGCTAGCTAGCTAGCTAGCTAGCTAGCTAGCTAGCTAGCTAGCTAGCTAGCTA
... 2,834 bp sequence
Length: 2.8k bp
GC: 42.5%
Predicted: Plant
Research-Grade Accuracy

Ready to Classify Your Sequences?

Upload FASTA files or paste sequences directly. Our ensemble model provides host predictions with confidence scores and detailed feature analysis.

Supported formats: FASTA, raw sequence • Max file size: 50MB