bpp-popgen  2.1.0
bpp::PolymorphismSequenceContainerTools Class Reference

Utilitary function to manipulate PolymorphismSequenceContainer. More...

#include <Bpp/PopGen/PolymorphismSequenceContainerTools.h>

List of all members.

Public Member Functions

 ~PolymorphismSequenceContainerTools ()

Static Public Member Functions

static
PolymorphismSequenceContainer
read (const std::string &path, const Alphabet *alpha) throw (Exception)
 Read a Mase+ file and return a PolymorphismSequenceContainer. Toggle Sequence when selection tag begin with OUTGROUP (see Polymorphix)
static
PolymorphismSequenceContainer
extractIngroup (const PolymorphismSequenceContainer &psc) throw (Exception)
 Extract ingroup sequences from a PolymorphismSequenceContainer and create a new one.
static
PolymorphismSequenceContainer
extractOutgroup (const PolymorphismSequenceContainer &psc) throw (Exception)
 Extract outgroup sequences from a PolymorphismSequenceContainer and create a new one.
static
PolymorphismSequenceContainer
extractGroup (const PolymorphismSequenceContainer &psc, size_t group_id) throw (Exception)
 Extract a special group from the PolymorphismSequenceContainer.
static
PolymorphismSequenceContainer
getSelectedSequences (const PolymorphismSequenceContainer &psc, const SequenceSelection &ss)
 Extract selected sequences.
static
PolymorphismSequenceContainer
sample (const PolymorphismSequenceContainer &psc, size_t n, bool replace=true)
 Get a random set of sequences.
static
PolymorphismSequenceContainer
getSitesWithoutGaps (const PolymorphismSequenceContainer &psc)
 Retrieves sites without gaps from PolymorphismSequenceContainer.
static size_t getNumberOfNonGapSites (const PolymorphismSequenceContainer &psc, bool ingroup) throw (Exception)
 Return number of sites without gaps in a PolymorphismSequenceContainer.
static size_t getNumberOfCompleteSites (const PolymorphismSequenceContainer &psc, bool ingroup) throw (Exception)
 Return number of completely resolved sites in a PolymorphismSequenceContainer.
static
PolymorphismSequenceContainer
getCompleteSites (const PolymorphismSequenceContainer &psc)
 Retrieves complete sites from a PolymorphismSequenceContainer.
static
PolymorphismSequenceContainer
excludeFlankingGap (const PolymorphismSequenceContainer &psc)
 exclude flanking sites with gap but keep gap sites within the alignment
static
PolymorphismSequenceContainer
getSelectedSites (const PolymorphismSequenceContainer &psc, const std::string &setName, bool phase)
 Get a PolymorphismSequenceContainer corresponding to a site selection annotated in the mase comments.
static
PolymorphismSequenceContainer
getNonCodingSites (const PolymorphismSequenceContainer &psc, const std::string &setName)
 Retrieve non-coding sites defined in the mase file header.
static
PolymorphismSequenceContainer
getOnePosition (const PolymorphismSequenceContainer &psc, const std::string &setName, size_t pos)
 Retrieve sites at one codon position (1,2,3)
static
PolymorphismSequenceContainer
getIntrons (const PolymorphismSequenceContainer &psc, const std::string &setName, const CodonAlphabet *ca)
 Retrieve intron sites.
static
PolymorphismSequenceContainer
get5Prime (const PolymorphismSequenceContainer &psc, const std::string &setName)
 Retrieve 5' sites.
static
PolymorphismSequenceContainer
get3Prime (const PolymorphismSequenceContainer &psc, const std::string &setName, const CodonAlphabet *ca)
 Retrieve 3' sites.
static std::string getIngroupSpeciesName (const PolymorphismSequenceContainer &psc)
 Get the species name of the ingroup.

Detailed Description

Utilitary function to manipulate PolymorphismSequenceContainer.

Author:
Sylvain Gaillard

Definition at line 71 of file PolymorphismSequenceContainerTools.h.


Constructor & Destructor Documentation


Member Function Documentation

Extract a special group from the PolymorphismSequenceContainer.

Parameters:
psca PolymorphismSequenceContainer reference.
group_idthe group identifier as an size_t.
Exceptions:
GroupNotFoundExceptionif group_id is not found.

Definition at line 149 of file PolymorphismSequenceContainerTools.cpp.

References bpp::PolymorphismSequenceContainer::deleteSequence().

Extract ingroup sequences from a PolymorphismSequenceContainer and create a new one.

Parameters:
psca PolymorphismSequenceContainer reference
Exceptions:
Exceptionif there is no ingroup sequence

Definition at line 103 of file PolymorphismSequenceContainerTools.cpp.

References bpp::PolymorphismSequenceContainer::deleteSequence().

Extract outgroup sequences from a PolymorphismSequenceContainer and create a new one.

Parameters:
psca PolymorphismSequenceContainer reference
Exceptions:
Exceptionif there is no outgroup sequence

Definition at line 126 of file PolymorphismSequenceContainerTools.cpp.

References bpp::PolymorphismSequenceContainer::deleteSequence().

Return number of completely resolved sites in a PolymorphismSequenceContainer.

Parameters:
psca PolymorphismSequenceContainer reference
ingroupa boolean set to true if you want to take only ingroup sequences into account
Exceptions:
Exceptionif there is no ingroup sequence

Definition at line 263 of file PolymorphismSequenceContainerTools.cpp.

References bpp::SimpleSiteContainerIterator::hasMoreSites(), bpp::SiteTools::isComplete(), and bpp::SimpleSiteContainerIterator::nextSite().

size_t PolymorphismSequenceContainerTools::getNumberOfNonGapSites ( const PolymorphismSequenceContainer psc,
bool  ingroup 
) throw (Exception) [static]

Return number of sites without gaps in a PolymorphismSequenceContainer.

Parameters:
psca PolymorphismSequenceContainer reference
ingroupa boolean set to true if you want to take only ingroup sequences into account
Exceptions:
Exceptionif there is no ingroup sequence

Definition at line 233 of file PolymorphismSequenceContainerTools.cpp.

References bpp::SiteTools::hasGap(), bpp::SimpleSiteContainerIterator::hasMoreSites(), and bpp::SimpleSiteContainerIterator::nextSite().

PolymorphismSequenceContainer * PolymorphismSequenceContainerTools::getOnePosition ( const PolymorphismSequenceContainer psc,
const std::string &  setName,
size_t  pos 
) [static]

Retrieve sites at one codon position (1,2,3)

Be carefull: to use before excluding gap Be careful: if there is no phase information, the method catch an exception and set the phase to 1 This allows to use this method for PolymorphismSequenceContainer generated by getSelectedSequence

Parameters:
psca PolymorphismSequenceContainer reference
setNamename of the CDS site selection
posposition index.

Definition at line 392 of file PolymorphismSequenceContainerTools.cpp.

References bpp::VectorSiteContainer::getGeneralComments(), bpp::PolymorphismSequenceContainer::getGroupId(), bpp::VectorSiteContainer::getNumberOfSequences(), bpp::VectorSiteContainer::getNumberOfSites(), bpp::MaseTools::getPhase(), bpp::SiteContainerTools::getSelectedSites(), bpp::PolymorphismSequenceContainer::isIngroupMember(), bpp::PolymorphismSequenceContainer::setAsIngroupMember(), bpp::PolymorphismSequenceContainer::setAsOutgroupMember(), and bpp::PolymorphismSequenceContainer::setGroupId().

PolymorphismSequenceContainer * PolymorphismSequenceContainerTools::getSelectedSites ( const PolymorphismSequenceContainer psc,
const std::string &  setName,
bool  phase 
) [static]

Get a PolymorphismSequenceContainer corresponding to a site selection annotated in the mase comments.

Be carefull : in the new PolymorphismSequenceContainer the mase comments are lost Information about cds positions and start codon is no more available

Parameters:
psca PolymorphismSequenceContainer.
setNameThe name of the set to retrieve.
phasea boolean set to true if you want to take the phase into account during the extraction. It removes the useless sites.

Definition at line 335 of file PolymorphismSequenceContainerTools.cpp.

References bpp::VectorSiteContainer::deleteGeneralComments(), bpp::SiteContainer::deleteSite(), bpp::VectorSiteContainer::getGeneralComments(), bpp::PolymorphismSequenceContainer::getGroupId(), bpp::VectorSiteContainer::getNumberOfSequences(), bpp::MaseTools::getPhase(), bpp::MaseTools::getSelectedSites(), bpp::PolymorphismSequenceContainer::isIngroupMember(), bpp::PolymorphismSequenceContainer::setAsIngroupMember(), bpp::PolymorphismSequenceContainer::setAsOutgroupMember(), and bpp::PolymorphismSequenceContainer::setGroupId().

PolymorphismSequenceContainer * PolymorphismSequenceContainerTools::read ( const std::string &  path,
const Alphabet alpha 
) throw (Exception) [static]

Read a Mase+ file and return a PolymorphismSequenceContainer. Toggle Sequence when selection tag begin with OUTGROUP (see Polymorphix)

Parameters:
pathPath to the Mase+ file
alphaSequence Alphabet
Exceptions:
Exceptionif the file is not in the specified format

Definition at line 50 of file PolymorphismSequenceContainerTools.cpp.

References bpp::MaseTools::getAvailableSequenceSelections(), bpp::OrderedSequenceContainer::getGeneralComments(), bpp::MaseTools::getSequenceSet(), bpp::Mase::readSequences(), and bpp::PolymorphismSequenceContainer::setAsOutgroupMember().

PolymorphismSequenceContainer * PolymorphismSequenceContainerTools::sample ( const PolymorphismSequenceContainer psc,
size_t  n,
bool  replace = true 
) [static]

Get a random set of sequences.

Parameters:
psca PolymorphismSequenceContainer reference
nthe number of sequence to get
replacea boolean flag true for sampling with replacement

Definition at line 192 of file PolymorphismSequenceContainerTools.cpp.

References bpp::VectorSiteContainer::getNumberOfSequences(), bpp::RandomTools::getSample(), and getSelectedSequences().


The documentation for this class was generated from the following files:
 All Classes Namespaces Files Functions Variables Typedefs Friends