<?xml version="1.0" encoding="ISO-8859-1"?><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
<front>
<journal-meta>
<journal-id>1684-1859</journal-id>
<journal-title><![CDATA[Revista Cubana de Informática Médica]]></journal-title>
<abbrev-journal-title><![CDATA[RCIM]]></abbrev-journal-title>
<issn>1684-1859</issn>
<publisher>
<publisher-name><![CDATA[Universidad de Ciencias Médicas de La Habana]]></publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id>S1684-18592018000200002</article-id>
<title-group>
<article-title xml:lang="es"><![CDATA[MIDAS: Aplicación informática para la identificación de microsatélites exactos e inexactos en secuencias genómicas.]]></article-title>
<article-title xml:lang="en"><![CDATA[MIDAS: Computer application for the identification of exact and inaccurate microsatellites in genomic sequences]]></article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name>
<surname><![CDATA[Martínez Ortiz]]></surname>
<given-names><![CDATA[Carlos M.]]></given-names>
</name>
<xref ref-type="aff" rid="Aff"/>
</contrib>
</contrib-group>
<aff id="Af1">
<institution><![CDATA[,Universidad de Ciencias Médicas ICPB &#8220;Victoria de Girón&#8221; Departamento de Bioquímica]]></institution>
<addr-line><![CDATA[La Habana ]]></addr-line>
<country>Cuba</country>
</aff>
<pub-date pub-type="pub">
<day>00</day>
<month>12</month>
<year>2018</year>
</pub-date>
<pub-date pub-type="epub">
<day>00</day>
<month>12</month>
<year>2018</year>
</pub-date>
<volume>10</volume>
<numero>2</numero>
<copyright-statement/>
<copyright-year/>
<self-uri xlink:href="http://scielo.sld.cu/scielo.php?script=sci_arttext&amp;pid=S1684-18592018000200002&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://scielo.sld.cu/scielo.php?script=sci_abstract&amp;pid=S1684-18592018000200002&amp;lng=en&amp;nrm=iso"></self-uri><self-uri xlink:href="http://scielo.sld.cu/scielo.php?script=sci_pdf&amp;pid=S1684-18592018000200002&amp;lng=en&amp;nrm=iso"></self-uri><abstract abstract-type="short" xml:lang="es"><p><![CDATA[RESUMEN Los microsatélites son secuencias cortas repetidas en tándem, frecuentes y diversas en los genomas de todas las especies, constituyendo importantes marcadores en múltiples áreas de investigación basadas en la genómica. Se han encontrado asociaciones de estos marcadores a un número importante de enfermedades en humanos. En el desarrollo de vacunas se ha demostrado cómo los patógenos pueden evadir la respuesta inmune simplemente alterando la composición de las secuencias repetidas en sus genes. Existen numerosas aplicaciones informáticas destinadas a la detección de estas secuencias, no obstante, éstas no cubren todas las expectativas debido a la divergencia de criterios y enfoques aplicados a la solución del problema de su detección. MIDAS implementa una solución no heurística basada en dos algoritmos combinatorios en serie: el primero detecta microsatélites exactos, y el segundo, de permitirlo los parámetros del modelo, extiende las secuencias a su versión inexacta óptima. La aplicación tiene como entrada la secuencia genómica en formato GBFF o FASTA y su salida brinda las posiciones de los microsatélites en la secuencia genómica, así como tamaños, alineamientos, flancos, posiciones, etc. El algoritmo tiene una elevada eficiencia y es exhaustivo, detectando todas las posibles secuencias repetidas independientemente de su composición nucleotídica.]]></p></abstract>
<abstract abstract-type="short" xml:lang="en"><p><![CDATA[ABSTRACT Microsatellites are tandem repeat, frequent and diverse short sequences in the genomes of all species, constituting important markers in multiple areas of genomics-based research. Associations of these markers have been found in a significant number of human diseases. Vaccine development has shown how pathogens can evade the immune response by simply altering the composition of repeat sequences in their genes. There are numerous computer applications for the detection of these sequences, but they do not meet all expectations due to the divergence of criteria and approaches applied to solving the problem of their detection. MIDAS implements a non-heuristic solution based on two combinatorial algorithms in series: the first one detects exact microsatellites, and the second one, if the model parameters allow it, extends the sequences to their optimal inaccurate version. The application has as input the genomic sequence in GBFF or FASTA format and its output provides the microsatellite positions in the genomic sequence, as well as sizes, alignments, flanks and other statistics. The algorithm is highly efficient and comprehensive, detecting all possible repeat sequences regardless of their nucleotide composition.]]></p></abstract>
<kwd-group>
<kwd lng="es"><![CDATA[SSR]]></kwd>
<kwd lng="es"><![CDATA[marcador molecular]]></kwd>
<kwd lng="es"><![CDATA[microsatélite]]></kwd>
<kwd lng="es"><![CDATA[minería de datos]]></kwd>
<kwd lng="es"><![CDATA[algoritmo]]></kwd>
<kwd lng="en"><![CDATA[SSR]]></kwd>
<kwd lng="en"><![CDATA[microsatellite]]></kwd>
<kwd lng="en"><![CDATA[molecular marker]]></kwd>
<kwd lng="en"><![CDATA[data mining]]></kwd>
<kwd lng="en"><![CDATA[algorithms]]></kwd>
</kwd-group>
</article-meta>
</front><back>
<ref-list>
<ref id="B1">
<label>1</label><nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Liang]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Watanabe]]></surname>
<given-names><![CDATA[H]]></given-names>
</name>
<name>
<surname><![CDATA[Terajima]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Li]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
<name>
<surname><![CDATA[Liao]]></surname>
<given-names><![CDATA[J]]></given-names>
</name>
<name>
<surname><![CDATA[Tung]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Chiou]]></surname>
<given-names><![CDATA[C.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Multilocus Variable-Number Tandem-Repeat Analysis for Molecular Typing of Shigella sonnei]]></article-title>
<source><![CDATA[JOURNAL OF CLINICAL MICROBIOLOGY]]></source>
<year>2007</year>
<volume>45</volume>
<numero>11</numero>
<issue>11</issue>
<page-range>3574-80</page-range></nlm-citation>
</ref>
<ref id="B2">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Martin]]></surname>
<given-names><![CDATA[P]]></given-names>
</name>
<name>
<surname><![CDATA[Makepeace]]></surname>
<given-names><![CDATA[K]]></given-names>
</name>
<name>
<surname><![CDATA[Hill]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Hood]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
<name>
<surname><![CDATA[Moxon]]></surname>
<given-names><![CDATA[E.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Microsat instability regulates transcription factor binding and gene expression]]></article-title>
<source><![CDATA[Proc Natl Acad Sci USA]]></source>
<year>2005</year>
<volume>102</volume>
<numero>10</numero>
<issue>10</issue>
<page-range>3800-4</page-range></nlm-citation>
</ref>
<ref id="B3">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mitas]]></surname>
<given-names><![CDATA[M.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Trinucleotide repeats associated with human disease]]></article-title>
<source><![CDATA[Nucleic Acids Res]]></source>
<year>1997</year>
<volume>25</volume>
<numero>12</numero>
<issue>12</issue>
<page-range>2245-54</page-range></nlm-citation>
</ref>
<ref id="B4">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Arzimanoglou]]></surname>
<given-names><![CDATA[I]]></given-names>
</name>
<name>
<surname><![CDATA[Gilbert]]></surname>
<given-names><![CDATA[F]]></given-names>
</name>
<name>
<surname><![CDATA[Barber]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Microsatellite instability in human solid tumors]]></article-title>
<source><![CDATA[Cancer]]></source>
<year>1998</year>
<volume>82</volume>
<numero>10</numero>
<issue>10</issue>
<page-range>1808-20</page-range></nlm-citation>
</ref>
<ref id="B5">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Moxon]]></surname>
<given-names><![CDATA[ER]]></given-names>
</name>
<name>
<surname><![CDATA[Rainey]]></surname>
<given-names><![CDATA[PB]]></given-names>
</name>
<name>
<surname><![CDATA[Nowak]]></surname>
<given-names><![CDATA[MA]]></given-names>
</name>
<name>
<surname><![CDATA[Lenski]]></surname>
<given-names><![CDATA[RE.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Adaptive evolution of highly mutable loci in pathogenic bacteria]]></article-title>
<source><![CDATA[Current Biology]]></source>
<year>1994</year>
<numero>4</numero>
<issue>4</issue>
<page-range>24-33</page-range></nlm-citation>
</ref>
<ref id="B6">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Moxon]]></surname>
<given-names><![CDATA[R]]></given-names>
</name>
<name>
<surname><![CDATA[Bayliss]]></surname>
<given-names><![CDATA[C]]></given-names>
</name>
<name>
<surname><![CDATA[Hood]]></surname>
<given-names><![CDATA[D.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Bacterial contingency loci: the role of simple sequence DNA repeats in bacterial adaptation]]></article-title>
<source><![CDATA[Annu. Rev.Genet]]></source>
<year>2006</year>
<volume>40</volume>
<page-range>307-33</page-range></nlm-citation>
</ref>
<ref id="B7">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Bayliss]]></surname>
<given-names><![CDATA[CD]]></given-names>
</name>
<name>
<surname><![CDATA[Field]]></surname>
<given-names><![CDATA[D]]></given-names>
</name>
<name>
<surname><![CDATA[Moxon]]></surname>
<given-names><![CDATA[ER.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[The simple sequence contingency loci of Haemophilus influenzae and Neisseria meningitidis]]></article-title>
<source><![CDATA[J. Clin. Invest]]></source>
<year>2007</year>
<volume>107</volume>
<page-range>657-62</page-range></nlm-citation>
</ref>
<ref id="B8">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Grover]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Aishwarya]]></surname>
<given-names><![CDATA[V]]></given-names>
</name>
<name>
<surname><![CDATA[Sharma]]></surname>
<given-names><![CDATA[PC.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Searching microsatellites in DNA sequences: approaches used and tools developed]]></article-title>
<source><![CDATA[Physiol Mol Biol Plants]]></source>
<year>2012</year>
<volume>18</volume>
<numero>1</numero>
<issue>1</issue>
<page-range>11-9</page-range></nlm-citation>
</ref>
<ref id="B9">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Leclercq]]></surname>
<given-names><![CDATA[S]]></given-names>
</name>
<name>
<surname><![CDATA[Rivals]]></surname>
<given-names><![CDATA[E]]></given-names>
</name>
<name>
<surname><![CDATA[Jarne]]></surname>
<given-names><![CDATA[P.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Detecting microsatellites within genomes: significant variation among algorithms]]></article-title>
<source><![CDATA[BMC Bioinformatics]]></source>
<year>2007</year>
<volume>8</volume>
<page-range>125</page-range></nlm-citation>
</ref>
<ref id="B10">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Benson]]></surname>
<given-names><![CDATA[G.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Tandem repeats finder: a program to analyze DNA sequences]]></article-title>
<source><![CDATA[Nucleic Acids Res]]></source>
<year>1999</year>
<volume>27</volume>
<page-range>573-80</page-range></nlm-citation>
</ref>
<ref id="B11">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Mudunuri]]></surname>
<given-names><![CDATA[SB]]></given-names>
</name>
<name>
<surname><![CDATA[Nagarajaram]]></surname>
<given-names><![CDATA[HA]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[IMEx: Imperfect Microsatellite Extracto]]></article-title>
<source><![CDATA[Bioinformatics]]></source>
<year>2007</year>
<volume>23</volume>
<page-range>1181-7</page-range></nlm-citation>
</ref>
<ref id="B12">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Merkel]]></surname>
<given-names><![CDATA[A]]></given-names>
</name>
<name>
<surname><![CDATA[Gemmell]]></surname>
<given-names><![CDATA[N.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Detecting short tandem repeats from genome data: opening the software black box]]></article-title>
<source><![CDATA[Brief Bioinform]]></source>
<year>2008</year>
<volume>9</volume>
<page-range>355-66</page-range></nlm-citation>
</ref>
<ref id="B13">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Zhou]]></surname>
<given-names><![CDATA[H]]></given-names>
</name>
<name>
<surname><![CDATA[Du]]></surname>
<given-names><![CDATA[L]]></given-names>
</name>
<name>
<surname><![CDATA[Yan]]></surname>
<given-names><![CDATA[H.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[Detection of tandem repeats in DNA sequences based on parametric spectral estimation]]></article-title>
<source><![CDATA[IEEE Trans Inf Technol Biomed]]></source>
<year>2009</year>
<volume>13</volume>
<page-range>747-55</page-range></nlm-citation>
</ref>
<ref id="B14">
<nlm-citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname><![CDATA[Castelo]]></surname>
<given-names><![CDATA[AT]]></given-names>
</name>
<name>
<surname><![CDATA[Martins]]></surname>
<given-names><![CDATA[W]]></given-names>
</name>
<name>
<surname><![CDATA[Gao]]></surname>
<given-names><![CDATA[GR.]]></given-names>
</name>
</person-group>
<article-title xml:lang=""><![CDATA[TROLL: Tandem repeats occurrence locator]]></article-title>
<source><![CDATA[Bioinformatics]]></source>
<year>2002</year>
<volume>18</volume>
<page-range>634-6</page-range></nlm-citation>
</ref>
</ref-list>
</back>
</article>
