Databank descriptors
What is a databank descriptor ?
A databank descriptor is a text file containing the instructions used by BeeDeeM to download and install databanks.
There are two types of descriptor:
bank descriptor (.dsc file): descriptor used to describe the installation of a single databank
global descriptor (.gd file): descriptor used to start the installation of one or several databanks
Description of databases to be managed: the bank descriptor
By default, BeeDeeM provides a non-exhaustive list of descriptors for processing various sequence databanks and biological classifications (ontologies). All of these files are suffixed with extension “.dsc” and are located in ${conf} directory.
Each file contains a group of instructions used by BeeDeeM:
to download (via FTP) all files making up the complete distribution of a database
to process the downloaded files to make them usable (decompressing, un-archiving, indexing, etc.)
Here is a sample bank descriptor aims at installing Uniprot_SwissProt:
The use of such a file will be explained in the next section.
The full format of the database descriptors is documented in section Databank descriptor format.
Description of processing to be performed: the global descriptor
The processing that BeeDeeM will perform is described in a global descriptor.
Here is an example of such a descriptor:
By default, BeeDeeM has a “test” descriptor for processing the installation of PDB Protein.
This descriptor is the file named 'test.gd' located in the directory ${conf}.
Note: We will use this file 'test.gd' in the rest of this manual to explain how to use BeeDeeM. However, you can create other descriptors (e.g. by deriving them from 'test.gd'), but always be sure to save them in the directory ${conf}.
Before starting any processing, it is VERY IMPORTANT to check the following two lines in the global descriptor:
The first line is a comma separated list of database descriptors to use (without their ".dsc" extension). It defines which databank(s) will be installed during a single BeeDeeM processing.
The second line gives a restart date. This line is only used in the case of a restart after a failure. If you start BeeDeeM for the first time or if you are updating the databases, it is absolutely imperative to set "resume.date" to the value none. All of this is explained in section Advanced uses.
But now, let's see how to install a databank using these descriptors!
Last updated
Was this helpful?