NCBI C++ ToolKit
Functions
win_mask_dup_table.hpp File Reference
#include <string>
#include <vector>
#include <set>
#include <objmgr/object_manager.hpp>
#include <objmgr/scope.hpp>
#include <objmgr/seq_entry_handle.hpp>
#include <objmgr/bioseq_ci.hpp>
#include <objmgr/seq_vector.hpp>
#include <objmgr/util/sequence.hpp>
#include <algo/winmask/win_mask_util.hpp>
+ Include dependency graph for win_mask_dup_table.hpp:
+ This graph shows which files directly or indirectly include this file:

Go to the source code of this file.

Go to the SVN repository for this file.

Functions

void CheckDuplicates (const vector< string > &input, const string &infmt, const CWinMaskUtil::CIdSet *ids, const CWinMaskUtil::CIdSet *exclude_ids)
 Check for possibly duplicate sequences in the input. More...
 

Function Documentation

◆ CheckDuplicates()

void CheckDuplicates ( const vector< string > &  input,
const string infmt,
const CWinMaskUtil::CIdSet ids,
const CWinMaskUtil::CIdSet exclude_ids 
)

Check for possibly duplicate sequences in the input.

input contains the list of input file names. The files should be in the fasta format. The function checks the input sequences for duplication and reports possible duplicates to the standard error.

Parameters
inputlist of input file names
infmtinput format
idsset of ids to check
exclude_idsset of ids to ignore

Definition at line 535 of file win_mask_dup_table.cpp.

References dup_lookup_table::sample::begin(), CWinMaskUtil::consider(), data, CBioseq_Handle::eCoding_Iupac, eGetId_Best, dup_lookup_table::sample::end(), CBioseq_Handle::GetBioseqLength(), GetId(), CObjectManager::GetInstance(), CSeq_id::GetLabel(), CBioseq_Handle::GetSeqVector(), i, input(), MIN_SEQ_LENGTH, om, SAMPLE_LENGTH, SAMPLE_SKIP, and table.

Referenced by CWinMaskCountsGenerator::operator()().

Modified on Thu Apr 25 08:20:53 2024 by modify_doxy.py rev. 669887