Operation on files
Certain file types can be combined to integrate information from multiple sources. xyna.bio allows for the following file operations:
Load Fasta
The fasta node automatically infers which type of data is contained in which file and generates the correct type of reference to be used with other nodes.
Input: file name
Output: fasta file in one of the formats (protein, nucleic acid, aligned protein, aligned nucleic acid, multi protein, multi nucleic acid).
Load GenBank
Converts a GenBank file into a fasta file. This node exports the gene sequence from the GenBank file to a gene fasta file while preserving the annotations in the file’s selections.
Input: GenBank file
Output: fasta file
Load PDB
Load a protein structure in PDB format.
Input: file name
Output: PDB file
Load SDF
Load a molecular structure in SDF format.
Input: file name
Output: SDF file
Smiles to structure
This node converts a SMILES (Simplified molecular-input line-entry system) string, given as text, into an SDF format structure. The SDF file format stores three-dimensional structural data of molecules like drugs or metabolites. (see Additional file formats).
The optimize option determines if xyna.bio should optimize the three-dimensional geometry of the molecule using molecular mechanics. This can be useful to obtain a more accurate three-dimensional structure of the molecule.
Input: SMILES string in text form
Output: SDF file
Add SDF to PDB
Adds a SDF structure to a PDB structure. Returns a PDB file with the added SDF
ligand.
Note: this does not perform any docking, the coordinates of both the protein and ligand are conserved as is.
To
position the ligand along the protein structure, please perform docking beforehand.
Input: PDB file, SDF file
Output: PDB file
Merge Fasta
Merges two protein fasta sequences into one multi fasta file.
Input: fasta file
Output: fasta file
Extract Selection from Structure
Gets a substructure of a given PDB file based upon the name of a previously defined selection. Returns all amino acids within the selection range as a new PDB structure. Does not return water molecules or other types of atoms present in the PDB file.
Input: PDB file
Output: PDB file of substructure
Extract Selection from Fasta
Extracts a subsequence from a fasta sequence based upon a selection name of a previously defined selection (see Annotation Nodes).
Input: fasta file
Output: fasta file of subsequence