DATABASE INFORMATION

Mosaic Virus Database is a user friendly information library.The importance of the database lies in the fact that this is the first database on the web which is specially and specifically for mosaic viruses.The user can get all the information like ORF Details, Genome Information in an integrated manner through this library.

The need for such a database lies in the fact that now for information in any sphere for any mosaic viruses the user wont have to go on different variety of sites and softwares.Here the user gets all the required data about the viruses under one roof.

Mosaic virus database is an online library of all the types of mosaic viruse's information.The database provides ready access to all the scientific information on all the mosaic viruses.

The main objective of MOSVIS is to provide a comprehensive,ready to use,searchable web-based full information database of all the Mosaic viruses discovered so far.

Alll the data in the libraray has been collected using the following mehodology.

>DATA RETRIEVAL:
Genome information of mosaic virus is retrieved from the NCBI (National Center of Biotechnological Information). Genome information includes accession number, GenBank id, taxonomy, host, strain, topology, length, etc. From this genomic information we prepared the genome database.

>GENOME DATABASE:
Accession no,Genbank ID,Acronym,Strain,Isolate,Host,Taxon,Molecule,Topology,Length,GC%

>DEFINITIONS:
Accession No. :
An accession number in bioinformatics is a unique identifier given to a DNA or protein sequence record to allow for tracking of different versions of that sequence.
GenBank :
The GenBank sequence database is an open access, annotated collection of all publicly available nucleotide sequences and their protein translations. This database is produced at National Center for Biotechnology Information (NCBI). GenBank and its collaborators receive sequences produced in laboratories throughout the world from more than 100,000 distinct organisms.
Acronym:
An acronym is a word formed from the first letters of a series of words that is used as an abbreviation to refer to that series of words. For example; acronym used for Tobacco Mosaic Virus is TMV.
Strain:
Strain is a group of organisms within a species that differ in trivial ways from similar groups.
Isolate:
Isolate is a required single organism separated from the whole population.
Host:
Host is an animal or plant that nourishes and supports a parasite.
Taxon :
Taxon is a classification or group of organisms (ie, kingdom, phylum, class, order, family, genus, species).
Topology:
It is the characteristic of DNA or RNA in an organism. E.g.; circular or linear.

>ORF DETAILS:
ORF details are retrieved with the use of ORF finder which is a tool present on NCBI database. FASTA format of the genome sequence of organism for the Orf details has been used. It includes reading frames, co-ordinates, length of nucleotide, length of of amino acids, nucleotide sequence, and amino acid sequence.

Accession no..ORF_id,frame,coordinates,Length_NT,Length_aa,NT_sequence,AA_sequence

ORFs:
Open reading frames are the reading frames where successive nucleotide triplets can be read as codons specifying amino acids and where the sequence of these triplets is not interrupted by stop codons.
ORF finder:
It identifies all possible ORFs in a DNA sequence by locating the standard and alternative stop and start codons.
>Properties of nucleotide sequence:
ORF_id,MW_ss,MW_ds,GC%,AT%,NT composition

For knowing the properties of nucleotide sequence BioEdit software is being used.

>BIOEDIT:
BioEdit is a biological sequence alignment editor.  An intuitive multiple document interface with convenient features makes alignment and manipulation of sequences relatively easy. Several sequence manipulation and analysis options and links to external analysis programs facilitate a working environment which allows one to view and manipulate sequences with simple point-and-click operations. It is intended to provide basic functions for protein and nucleic sequence editing, alignment, manipulation and analysis. Bioedit is used for the nucleotide sequence information that includes molecular weight, GC%, AT%, Nucleotide composition.

Various tools and softwares like MySQL,Macromedia Dreamweaver,Eclipse IDE,Apache Tomcat web server have been used to compile the entire Data collected and the overall project developement.