Filogeografa-de-boa-constrictor-serpentes-boidae-en-Mexico-y-Centro-America

•

Outros

Estudiando Biologia

28/7/2022

¡Este material tiene más páginas!

Entonces, ¿te gustó este material?

Ayude a animar a otros estudiantes a mejorar el contenido

¿Te gustó este material? ¡Compartir! 🧡

Biología

328.638 Materiales compartidos

Descarga la aplicación para disfrutar aún más

Lea materiales sin conexión, sin usar Internet. Además de muchas otras características!

Vista previa del material en texto

UNIVERSIDAD NACIONAL AUTÓNOMA DE MÉXICO
POSGRADO EN CIENCIAS BIOLÓGICAS
INSITUTO DE ECOLOGÍA
BIOLOGÍA EVOLUTIVA

Filogeografía de Boa constrictor (Serpentes: Boidae) en México y Centro América

TESIS
QUE PARA OPTAR POR EL GRADO DE:
DOCTOR EN CIENCIAS

PRESENTA:
Marco Alejandro Suárez Atilano

TUTORA PRINCIPAL DE TESIS: DRA. ELLA VÁZQUEZ DOMÍNGUEZ
INSTITUTO DE ECOLOGÍA
COMITÉ TUTORAL
DR. DANIEL IGNACIO PIÑERO DALMAU
INSTITUTO DE ECOLOGÍA
DRA. GABRIELA PARRA OLEA
INSTITUTO DE BIOLOGÍA

MÉXICO, D.F. JUNIO 2015

UNAM – Dirección General de Bibliotecas
Tesis Digitales
Restricciones de uso

DERECHOS RESERVADOS ©
PROHIBIDA SU REPRODUCCIÓN TOTAL O PARCIAL

Todo el material contenido en esta tesis esta protegido por la Ley Federal
del Derecho de Autor (LFDA) de los Estados Unidos Mexicanos (México).
El uso de imágenes, fragmentos de videos, y demás material que sea
objeto de protección de los derechos de autor, será exclusivamente para
fines educativos e informativos y deberá citar la fuente donde la obtuvo
mencionando el autor o autores. Cualquier uso distinto como el lucro,
reproducción, edición o modificación, será perseguido y sancionado por el
respectivo titular de los Derechos de Autor.

UNAM – Dirección General de Bibliotecas
Tesis Digitales
Restricciones de uso

DERECHOS RESERVADOS ©
PROHIBIDA SU REPRODUCCIÓN TOTAL O PARCIAL

UNIVERSIDAD NACIONAL AUTÓNOMA DE MÉXICO
POSGRADO EN CIENCIAS BIOLÓGICAS
INSITUTO DE ECOLOGÍA
BIOLOGÍA EVOLUTIVA

Filogeografía de Boa constrictor (Serpentes: Boidae) en México y Centro América

TESIS
QUE PARA OPTAR POR EL GRADO DE:
DOCTOR EN CIENCIAS

PRESENTA:
Marco Alejandro Suárez Atilano

TUTORA PRINCIPAL DE TESIS: DRA. ELLA VÁZQUEZ DOMÍNGUEZ
INSTITUTO DE ECOLOGÍA
COMITÉ TUTORAL
DR. DANIEL IGNACIO PIÑERO DALMAU
INSTITUTO DE ECOLOGÍA
DRA. GABRIELA PARRA OLEA
INSTITUTO DE BIOLOGÍA

MÉXICO, D.F. JUNIO 2015

U~M ·-·· POSG~DO~
Ciencias Bio lóg icas
Dr. Isidro Ávila Martínez
Olrector General de Admin istración Escolar, UNAM
Presente
COORDINACiÓN
Me permito InfOl"rnar a usted. que el Subcomrté de Biología EvolutIVa y Sistematica , en su sesión
ordinaria del día 23 de marzo de 2015, aprobó el jurado para la presentación de su examen para
obtener el grado de DOCTOR EN CIENCIAS. del Posgrado en CienCias BiológicaS, del alumno SUÁREZ
ATILANQ MARCO ALEJANDRO con numero de cuenta 3021 42558, con la tesis titulada
"FllOGEOGRAFIA DE BOA CONSTRICTOR (Serpentes: Soldas) EN M~)(jCO y CENTRO AM~R1CA",
bafO la dirección de la DRA. ELLA GLORIA VÁZQUEZ DOMINGUEZ;
DR ADRIÁN NIETO MONTES DE OCA
DR JOS!: JAIME ZÚÑIGA VEGA
ORA. GA8RIELA PARRA OLEA
DR aCTAVIO RAFAEL ROJAS SOTO
Presidente
Vocal
Secretano
Suplente
Suplente DR FAUSTO ROBERTO MENDEZ DE LA CRUZ
Sin otro particular, me es grato enviarle un cordial saludo
ATENTAMEN TE
" POR MI RAZA HABLARA EL ESPIRITU"
Cd Unlversitaria. O.F a 02 de JUniO de 2015 ",o
~
"'~ C1ENC"ls
ORA M~~~D0::~GA ~ COORDINACiÓN
COORDINADORA OEL PROGRAMA
e.c.p. Expediente del (la) Interesado (a)
L:mdad de Posgrado · Coordinación del Posgrado en Ciencias Biológicas Edificio O. tcr. PISO. CircUito de Po!>gradth Cd Univcrs il3ria
Delegac ión Coyoacan c.r . 04510 México. O.E Tel 5623 1002 hup:/ pcbiol.posgrado.una01 01\

Agradecimientos Institucionales

Al Posgrado en Ciencias Biológicas de la Universidad Nacional Autónoma de México por
aceptarme en el Programa de Doctorado en Ciencias Biológicas, así como por el apoyo
proporcionado durante la realización de mis estudios y de esta tesis.
Al Consejo Nacional de Ciencias y Tecnología por la beca de Doctorado otorgada para
realizar mis estudios de posgrado (CONACyT-346511). Al programa de Beca Mixta (CONACyT)
y PAEP-UNAM por el apoyo otrogado para realizar diferetentes estancias de investigación y
presentación de los productos de esta tesis en congresos en el extranjero .
A los proyectos CONACyT-101881 y PAPIIT-IN217910 por el financiamiento para
realizar trabajo de campo y de laboratorio.
A la Dra. Ella Vázquez Domínguez, quien fue directora de esta tesis y tutora principal
durante mis estudios de posgrado.
A la Dra. Gabriela Parra Olea y al Dr. Daniel Piñero Dalmau, integrantes de mi comité
tutor, por su constante asesoría e invaluables comentarios durante la realización de mi tesis
doctoral.

Agradecimientos Personales

A mis padres Yolanda y Alejandro, quienes siempre han ayudado y apoyado
incondicionalmente a su pequeño en todas las locuras que se le han ocurrido, una de ellas: el
Doctorado…
A mi prima casi hermana Marisol, que siempre ha estado al pendiente de mi y que al
lado de mis padres, se preocupó por mi en salidas de campo, estancias, congresos, etc. (Por
aquello de los fenómenos naturales…)
A Ella, por creer en mi para poder “saltarme la Maestria” y ponerme en las manos este
proyecto que ha significado tanto para mi. No solo agradezco por estos 5 años de doctorado,
sino por estos ya 8 años de boas y de mucho aprendizaje no sólo en lo académico, sino en lo
personal también.
A los sinodales, miembros del jurado para la presentación de mi examen: el Dr. Adrian
Nieto Montes de Oca, el Dr. Jaime Zúñiga Vega, la Dra. Gabriela Parra Olea, el Dr. Octavio
Rafael Rojas Soto y el Dr. Fausto Mendez de la Cruz, por las correcciones y observaciones
realizadas a este trabajo.
Al PhD. Frank T. Burbrink del College of Staten Island, City University of New York por la
aseosoría y facilidades otorgadas durante la estancia académica que realicé en su laboratorio.
Al Dr. Octavio Rafael Rojas Soto y al Dr. Enrique Martínez Meyer por su asesoría en
diferentes actividades acádemicas complementarias, realizadas durante mis estudios de
doctorado.
A “mis mantas” Susette (Madame Suse) y Tania (La Chidis), a ustedes par de psicópatas
con las cuales he pasado tantas cosas en todos lados, gracias por aguantarnos tantos años.
A “la banda del labo GenyEcol”, con los cuales he pasado inumerables aventuras, dentro
y fuera del laboratorio: Alex, Mariana, Nancy, Azalea, Coni, JC y Erick.
A “mi banda de la Fac”: Flower, Eunisss, Montu, Jelipe, Sophie con quienes me ha
tocado estar en las buenas, en las malas y en las peores.
A “mi banda de la prepa”: Francisco, Diana y Armando, entre nosotros muchas cosas
han cambiado en estos casi 15 años pero han permanecido conmigo todos este tiempo

INDICE

Contenido

Resumen

Abstract

Introducción General

Capítulo 1: Microsatellites: advantages and drawbacks in phylogeographic
analysis

Capítulo 2: Phylogeographical structure within Boa constrictor imperator
across the lowlands and mountains of Central America and Mexico

Capítulo 3: The environment's role on the genetic divergence between two
Boa constrictor lineages from Mexico and Central America

Discusión y conclusiones finales

Bibliografía

Página

103106

Resumen
La filogeografía tiene como objetivo conocer cómo los procesos históricos a diferentes escalas
han afectado la diversidad genética presente en las poblaciones naturales. Sumado a esto, la
integración de datos ambientales vía el Modelado de Nicho Ecológico, ha permitido el análisis
de la evolución del nicho ecológico y su influencia en los patrones de diferenciación genética,
desde un nivel poblacional hasta el de especie. La compleja historia geológica y climática de
México y Centro América, hacen de esta región una zona de estudio ideal para analizar
diferentes procesos a nivel ecológico y evolutivo. El objetivo principal de nuestro trabajo fue el
análisis de la estructura filogeográfica y de la distribución ecológica y geográfica de la Boa
constrictor imperator en México y Centro América, con el fin de identificar aquellos procesos
históricos que le dieron forma a su diversidad y distribución actual. Con una muestra de 145
individuos y diferentes tipos de marcadores moleculares: citocromo b, intrón de la ODC y
microsatélites, implementamos análisis de inferencia filogenética, genética espacial, modelos
basados en simulaciones, Modelado de Nicho ecológico, interpredictibilidad, similitud y
equivalencia de nicho, así como pruebas de Mantel. Los análisis genéticos permitieron la
identificación de dos linajes recíprocamente monofiléticos para el citocromo b, uno distribuido a
lo largo de la Costa del Pacífico Mexicano y otro presente en la región del Golfo de México,
península de Yucatán y Centro América, con una divergencia estimada de 5.2 millones de años.
Ambos linajes muestran una subdivisión interna en haplogrupos, los cuales presentan un
crecimiento demográfico sostenido y una expansión poblacional reciente. se observaron altos
niveles de diversidad genética tanto para el citocromo b (h = 0.944) como para los
microsatélites (HNei = 0.810–0.900). Por otro lado, las predicciones de la distribución tanto
geográfica como ambiental basada en los dos linajes principales, indicaron altos niveles de
interpredicción, así como evidencia de diferenciación de nicho ecológico entre ambos. Con base
en los microsatélites, se detectaron patrones de aislamiento por distancia ecológica y geográfica
(Euclideana). Estos resultados concuerdan con aquellos encontrados en otros organismos de
origen Neotropical, los cuales se dispersaron también a través del Istmo de Panamá. Nuestros
resultados resultan de suma importancia para la conservación de la boa, tanto porque sus
relaciones taxonómicas, en particular respecto a las subespecies Sudamericanas, todavía no
están resueltas, como por su estatus de especie protegida nacional e internacionalmente.

Palabras Clave: ADN mitocondrial, boa, Gran Intercambio Biótico Americano, microsatélites,
Modelado de nicho ecológico, Neotrópico, reptiles

Abstract
Phylogeography focuses on the understanding of how population and historical processes may
influence genetic diversity in extant populations. Furthermore, integration of environmental data
via Ecological Niche Modelling has allowed the analysis of how the evolution of niche can
influence patterns of genetic differentiation, from the population to the species level. The
complex geology and climatic history of the region encompassing México and Central America
make it an ideal setting for studies of evolutionary and ecological processes. Our main objective
was the analysis of the phylogeographic structure and the analysis of the ecological and
geographic distribution of Boa constrictor imperator in Mexico and Central America, with the aim
to identify the key historical events responsible of their current distribution and diversity. Based
on 145 individuals, we combined molecular data from mitochondrial cytochrome b (cyt b) and
nuclear ODC intron sequences and microsatellites, implementing analyses of phylogenetic
inference, spatial genetics, model-driven methods, Ecological Niche Modelling, interprediction,
niche equivalency and similarity and Mantel tests. The cyt b results showed two main
reciprocally monophyletic lineages, one along the Mexican Pacific coast and another along the
Gulf of Mexico, Yucatan Peninsula and Central America, diverging c. 5.2 million years ago. Both
lineages are furhter subdivided into haplogroups, which show steady historical growth and a
more recent population expansion. High genetic diversity was observed for both cyt b (h =
0.944) and microsatellites (HNei = 0.810–0.900). On the other hand, environmental and
geographic prediction analyses, based on the two mitochondrial lineages, indicated high levels
of asymmetrical interprediction and evidence of niche differentiation. Based on the microsatelites
data (genetic distamces), isolation by distance patterns were observed in relationship with
ecological and geographic (Euclidean) distances. Results are in agreement with
phylogeographic histories of other Neotropical organisms that also dispersed across the Isthmus
of Panama. Our results are relevant for the boa’s conservation given its unresolved taxonomic
status with respect to all South American subspecies and its conservation status both nationally
and internationally.

Key words: boa, Ecological niche modelling, Great American Biotic Interchange, microsatelites,
mitochondrial DNA, Neotropics, reptiles

INTRODUCCIÓN GENERAL

Filogeografía

Podemos conceptualizar a la diversidad biológica como un continuo si consideramos que la
variedad de especies, la variación genética presente al interior de cada una, así como las
diferencias entre los ecosistemas donde éstas se desarrollan, son producto del proceso
evolutivo entendido a diferentes escalas temporales y espaciales (Ramanatha y Hodgking,
2002). Sumado a esto, dado que los procesos ecológicos y evolutivos ocurren indisolublemente
en un contexto geográfico, los diferentes niveles de diversidad biológica usualmente presentan
un cierto grado de asociación o estructura espacial (Lomolino & Heaney 2004, Lomolino et al.
2005, Diniz-Filho et al. 2008). Así, resulta impensable el análisis de estos procesos sin
entenderse en un contexto geográfico.
Tradicionalmente, el análisis de este tipo asociación de la biota con la geografía se ha
enfocado en especies u otras categorías taxonómicas superiores (procesos macroevolutivos),
principalmente bajo el enfoque de la biogeografía histórica o la paleobiología. Por otro lado, la
genética de poblaciones, la cual tiene como objetivo el estudio de los procesos evolutivos que
ocurren en las poblaciones (microevolutivos), resalta que la distribución de la variación genética
al interior de una especie es resultado de la combinación de diferentes fuerzas a nivel molecular
y poblacional, incluyendo mutación, deriva genética, flujo genético, selección e historia
demográfica (Wang 2010).
Estos dos campos de estudio tradicionalmente separados tanto por su origen como por
sus enfoques teóricos y metodológicos, han sido sintetizados en la disciplina denominada
durante la década de 1980’s como filogeografía, la cual permite dilucidar aquellos procesos
ecológicos y evolutivos asociados con la distribución geográfica de las poblaciones naturales, a
partir del análisis de sus linajes de genes (Avise 1987, 2000). Dado que uno de los objetivos
primordiales de la filogeografía es ser un nexo entre la genética de poblaciones y la sistemática
filogenética, también considera la presencia de zonas híbridas y la delimitación de especies
(Avise et al. 1987, Hewitt 2001). Por lo tanto, la utilización de marcadores moleculares con
diferentes tasas de evolución y mecanismos de herencia, resulta invaluable para el análisis y la
inferencia de diferentes procesos ecológicos y evolutivos (Vázquez-Domínguez et al. 2009).
El uso de diferentes tipos de marcadores moleculares (eg. extranuclearesvs. nucleares),
así como de una gran diversidad de métodos estadísticos, ha permitido a la filogeografía tener
repercusiones importantes en la biología de la conservación, no sólo en aquellos casos
5

relacionados con la delimitación de especies, sino en su aplicación para evaluar niveles de
diversidad genética infraespecífica, bajo el concepto de las Unidades Evolutivas Significativas
(ESU’s, Moritz 2002). Por ello, la información genética ofrece una herramienta de suma
importancia para delimitar dichas unidades y proveer un contexto en el cual desarrollar
estrategias y definir prioridades de conservación (Pertoldi et al. 2007, Domínguez-Dominguez &
Vázquez Domínguez 2009).

Métodos filogeográficos

La filogeografía en la actualidad está basada en un conjunto de fundamentos teóricos y
métodos de análisis bien delimitados, algunos de ellos compartidos por otras disciplinas y
muchos otros desarrollados per se durante estos 25 años de consolidación y desarrollo de la
disciplina. Podemos mencionar a los denominados métodos exploratorios (Knowles 2009,
Garrick et al. 2010), los cuales se basan en el análisis filogenético a nivel intraespecífco, y cuya
característica principal es que permiten analizar de manera preliminar la ocurrencia de eventos
históricos sin la necesidad de probar una hipótesis a priori (Garrick et al. 2010, Zink et al. 2010).
Dentro de sus principales ventajas es que pueden ser aplicados en diferentes sistemas de
estudio, independientes del tipo de marcador o tamaño de muestra, tomando en cuenta pocas
suposiciones previas, además de ser poco demandantes en términos de tiempo computacional.
Aunque la aproximación cualitativa de este tipo de análisis se basa en la simple superposición
de los árboles o redes obtenidos sobre la distribución geográfica de las poblaciones analizadas,
permiten de manera fácil y directa generar hipótesis acerca de los eventos o procesos que
generaron cierto tipo de estructura (Avise 1987, 2000, Vázquez-Domínguez et al. 2009).
En un intento por extraer información complementaria a aquella obtenida de los análisis
de linajes genéticos, la filogeografía implementa el uso de estadísticos de resumen (Knowles
2002), generados a partir de modelos de evolución molecular neutrales (sin selección natural),
desarrollados en el marco teórico de la genética de poblaciones. De manera más reciente, la
filogeografía estadística basada en simulaciones utiliza la teoría de coalescencia para inferir la
ocurrencia de procesos poblacionales, tales como fluctuación del tamaño poblacional, selección
natural o variaciones en el flujo génico, por lo que la teoría de coalescencia permite la
descripción de las relaciones históricas entre linajes de genes como una función de los
parámetros demográficos (Fu y Li 1993a). Se entiende entonces que existe una intrincada
relación entre demografía y la genealogía de genes, de manera que las variables demográficas
6

son factores decisivos que gobiernan la profundidad, forma y procesos de ramificación y
coalescencia de linajes (Fu y Li 1993b, Gillespie 2004).
En la última década, los avances computaciones y el desarrollo de nuevos algoritmos
estadísticos han propiciado la formación de un nuevo enfoque metodológico en la filogeografía,
en donde una hipótesis filogeográfica postulada a priori puede ser probada a partir de la
exploración y el análisis de un universo de eventos alternativos, generando escenarios a
posteriori basados en la información previa, obtenida tanto de los modelos exploratorios como
de los estadísticos de resumen. A esta nueva generación de estadísticos se les denominan
métodos basados en simulaciones (Model-Driven Methods, Garrick et al. 2010). La mayoría
implementan aproximaciones computacionales bayesianas (ABC) o de Máxima Verosimilitud
(ML) y pueden analizar grupos de datos complejos, con tamaños de muestra grandes, para con
ello obtener la información conjunta de diferentes clases de loci. Posteriormente, se puede
distinguir estadísticamente de entre una serie de escenarios que pueden combinar información
de diferentes procesos demográficos como divergencia, admixia, cuellos de botella o migración,
así como la ocurrencia de estos en diferentes momentos en el tiempo. Ello permite discriminar
entre escenarios históricos tan diferentes como la retención ancestral de polimorfismos entre
dos poblaciones y el aislamiento con migración entre dos linajes (Knowles 2009, Hey 2010).
Junto con la versatilidad de los métodos exploratorios basados en filogramas y la
capacidad de inferir procesos en el tiempo con gran precisión estadística obtenida de los
métodos basados en simulaciones, otro gran avance que se suma al desarrollo de la
filogegorafía es la capacidad de “tomar a la geografía” de manera más explícita para generar
nuevos estimadores e implementar nuevos métodos. El surgimiento de áreas como la ecología
del paisaje o la genética del paisaje, sumado al perfeccionamiento de los sistemas de
información geográfica (SIG), han permitido trabajar directamente con la información
geogespacial (e.g. clima, vegetación, geología) para analizar procesos a escalas locales como
la conectividad del hábitat o la estructura genética espacial (Guillot et al. 2009, Chang et al.
2010). Estos métodos, al ser extrapolados a las escalas geográficas regionales o incluso
continentales típicamente utilizadas en la filogeografía, permiten hacer inferencias acerca de
procesos evolutivos como la presencia de discontinuidades genéticas (Guillot et al. 2009), las
cuales pueden relacionarse con características fisiográficas tales como ríos, cordilleras
montañosas o cambios en el tipo de vegetación, las cuales han sido consideradas como
barreras biogeográficas o usadas para delimitar provincias biogeográficas (Chang et al. 2010).
Un avance en particular, relacionado con el área de la informática de la biodiversidad, es
la aplicación de los Modelos de Predicción de la Distribución de Especies (Modelado de Nicho
7

Ecológico) a partir de los datos de presencia de una especie y de variables ambientales
(climáticas, topográficas, etc.). Con esta información se generan modelos de nicho ecológico
que, al ser proyectados en la geografía, indican áreas particulares donde las condiciones
ambientales son aptas para localizar a la especie en cuestión (Elith et al. 2006, Peterson 2006).
La creación de modelos de predicción a partir del modelado de nicho ecológico ha tenido
impacto en distintos aspectos de la inferencia filogeográfica, ya que al utilizarse en conjunto con
los datos genéticos, han permitido por ejemplo generar hipótesis de probables refugios o
centros de diversificación (Jakob et al. 2010).
Por otro lado, el análisis combinado de la diferenciación genética entre linajes sumado a
la identificación de las variables bioclimáticas con mayor contribución en el modelo de nicho,
han permitido identificar posibles factores ecológicos relacionados con procesos de
diversificación (Raxworthy et al. 2007). Aún más, el análisis de las distribuciones de especies
fiologenéticamente cercanas junto con la evidencia filogeográfica, han sido utilizadas como
herramienta para delimitar especies e identificar factores de especiación (Martínez-Gordillo et
al. 2010, Sánchez-Fernández et al. 2011).

México y Centro América como áreas de estudio

La peculiar distribución y gran diversidad de especies presentes en México son resultado de
una serie de factores históricos particulares: la gran variación topográfica y climática del
territorio resultado de su compleja historia geológica (Savage 1982), además de la ubicación de
México entre dos regiones biogeográficas (la Neártica y la Neotropical), posición que le otorga
un carácter de transición faunística y florística único en el planeta (Ochoa-Ochoa y Flores Villela
2006). A esta zona de transición, donde se puede reconocerla superposición de biotas tanto
neárticas y neotropicales se le ha denominado como la Zona de Transición Mexicana (ZTM,
Halffter 1976).
La extensión de esta zona de transición ha sido analizada bajo persepectivas tanto
panbiogeográficas (Morrone y Márquez 2001, Escalante et. al. 2004, Morrone y Gutiérrez 2005,
Huidobro et al. 2006, Morrone 2010) como cladísticas (Flores-Villela y Goyenechea 2001,
Contreras-Medina et al. 2007, Flores-Villela y Martínez-Salazar 2009, Miguez-Gutiérrez et al.
2013) considerando diferentes taxa terrestres y dulceacuícolas. Algunos de estos análisis
indican que la ZTM incluye el territorio que comprende desde el suroeste de Estados Unidos
hasta las tierras bajas del sur de Nicaragua (Halffter 1976, Marshall y Liebherr 2000; Contreras-
8

Medina et al., 2007), mientras que otros restringen esta zona de transión al territorio que
comprende la Faja Volcanica Transmexicana, la Sierra Madre Occidental y la Sierra Madre
Occidental (Ortega y Arita 1998, Escalante et al. 2004, Morrone 2006).
Si bien no existe un conscenso respecto a la extensión de la ZTM, se considera como
límite de las regiones Neártica y Neotropical a la Faja Volcánica Transmexicana, la cual separa
a las áreas de afinidad neártica hacia el norte y las de afinidad neotropical al sur, además de
considerar al Istmo de Tehuantepec y la Depresión de Nicaragua, como principales eventos
vicariantes al interior de la ZTM (Contreras-Medina et al. 2007, Miguez-Gutiérrez et al. 2013).
Por otro lado, debe considerarse a Centro América en un sentido estricto como al
territorio continental que se extiende desde el Istmo de Tehuantepec en México hasta la
depresión del río Atrato, la frontera entre Panamá y Colombia (Bunduschuh y Alvarado 2007).
Su formación y evolución geológica está ligada a la interacción tectónica de las placas
Norteamericana, Nazca, Cocos y Caribe, así como la micro placa de Panamá; donde la placa
oceánica de Cocos se subduce por debajo de las placas continentales Norteamericana, del
Caribe y el bloque de Panamá (Bunduschuh y Alvarado 2007, Montes et al. 2015). En un inicio,
la subducción de la corteza de Cocos propició la elevación de la porción norte de esta región
durante el Mioceno hace 25 millones de años. Hace 15 millones de años Centro América surgió
como un archipiélago el cual se extendió hasta Sur América durante un período de 8 millones
de años. Se considera que el cierre total del Istmo de Panamá y la separación del Océano
Pacífico y Mar Caribe ocurrió hace 3 – 2.5 millones de años (Coates y Obando 1996).
Junto con 1) la emersión del Istmo de Panamá y 2) los procesos orogénicos en Centro
América a partir del Mioceno, otros eventos biogeográficos fundamentales que han moldeado la
distribución actual de la biota en la región son 3) los ciclos glaciales del Pleistoceno, lo cuales
conllevaron cambios climáticos, alternancia en la distribución de los tipos de vegetación y
fluctuaciones del nivel del mar (Webb 1991, Zamudio & Green 1997, Wüster et al. 2005,
Gutiérrez-García y Vázquez-Domínguez 2013). Dado este escenario, Centro América ha
funcionado como conexión terrestre entre dos masas continentales de diferente origen
geológico, Norte América de origen Laurásico y Sudamérica de origen Gondwánico que, en
diferentes momentos de la historia, ha permitido la dispersión de organismos entre ambas
masas continentales, como en el caso del Gran Intercambio Biótico Americano ocurrida durante
el límite Plio-Pleistocénico (Woodburne 2010).
Considerando la dinámica y complejidad fisiográfica, geológica y climática para la región
del Neotrópico Mexicano, la Zona de Transición Mexicana y Centro América, esta área resulta
idónea y de inmenso interés para analizar cómo diferentes procesos históricos a diferentes
9

escalas temporales y espaciales, han modelado la diversidad biológica y su distribución
geográfica, resaltando principalmente las diferencias entre aquellos organismos tanto de origen
Neártico como Neotropical.

Boa constrictor y su diferenciación intraespecífica

Boa constrictor es el miembro de la familia de serpientes Boidae más ampliamente
distribuido, desde el norte de México hasta el norte de Argentina (Boback 2005). Es un
depredador generalista de vertebrados terrestres que habita un amplio espectro de ambientes
por debajo de los 1,500 msnm (Brambilia-Navarrete 2007, Álvarez-Romero et al. 2008).
Tradicionalmente se han reconocido diversas poblaciones a lo largo de su distribución, a las
cuales se les ha dado el estatus de subespecie, delimitadas principalmente por su distribución
restringida a determinadas áreas geográficas y por la variación en características morfológicas
tales como el número de escamas dorsales, número de manchas dorsales y longitud total
(Langhammer 1983, Price y Russo 1991, Rivera et al. 2005). Análisis morfológicos
(Langhammer 1983, Casas-Andreu 1992, Santoyo-Brito 2007, Wright 2012) y moleculares
(Hynková et al. 2009, Reynolds et al. 2014) indican que la subespecie, B. c. imperator, presente
desde México hasta el norte de Colombia, es reconocible como un grupo natural (clado)
separado respecto del resto de sus congéneres sudamericanos, sin embargo la diferenciación
genética así como las relaciones filogenéticas entre las poblaciones de esta subespecie, en
realidad todavía son poco conocidas. Sumado a esto, Boa constrictor posee poblaciones
reducidas en México y está considerada en la NOM-059-2001 como una especie amenazada,
principalmente por su tráfico y comercialización como mascota, así como por la pérdida del
hábitat. Asimismo, está enlistada en el Apéndice II del CITES, junto con la gran mayoría de las
serpientes de la familia Boidae (Bertona & Chiaraviglio 2003, Álvarez-Romero et al. 2005,
Bueno et al. 2005, Brambilia-Navarrete 2007).
Bajo este contexto, en el presente proyecto se establecieron las siguientes metas y
objetivos: (i) analizar la diversidad y estructura genética dentro y entre poblaciones de B. c.
imperator en México y Centro América, (ii) identificar los principales eventos históricos y
geológicos en México y Centro América que determinaron la distribución y diversificación, y (iii)
evaluar si la divergencia entre sus linajes genéticos concuerda con eventos geológicos o
biogeográficos detectados para otras especies de vertebrados terrestres con una distribución
similar. Para responder estas preguntas, se buscó incorporar no sólo el uso de un marcador
molecular, sino también información genealógica proveniente de diferentes marcadores, con
10

diferente tipo de herencia (mitocondrial vs. nuclear) e inferencia filogenética a tratar (secuencias
vs. frecuencias alélicas), que permitieran identificar algunos procesos históricos difícilmente
detectables con un solo tipo de marcador molecular (e.g. hibridación). Esta primera parte se
presenta en el capítulo 2, como artículo científico publicado en el Journal of Biogeography,
titulado “Phylogeographical structure within Boa constrictor imperator across the lowlands and
mountains of Central America and Mexico”.
A partir del patrón filogeográfico identificado en la primera parte de la tesis, en el cual se
obtuvieron dos linajes genéticos que presentan una regionalización geográficamente
distinguible, se utilizaron modelos para la predicción de la distribución de especies vía
modelado de nicho ecológico (ENM), se plantéo: (i) explorar la distribución potencial entre los
linajes genéticos de B. c. imperator, (ii) evaluar si existen diferencias significativas entre las
distribuciones predichas para estos linajes en términos de su espacio ecológico y ambiental, y
(iii) analizar si es posible detectar cierta diferenciación ambiental asociada con la información
proporcionada por diferentes tipos de marcador genético (mitocondrial vs. microsatélites). Ello,
por medio del análisis de modelado de nicho ecológicoy uso de diferentes estadísticos (e.g.
PCA, D de Schoener, I de Hellinger), cuyos resultados se presentan en el capítulo 3, “The
environment's role on the genetic divergence between two Boa constrictor lineages from Mexico
and Central America”.
Finalmente, dado que en este estudio la información que proporcionaron los diferentes
tipos de marcador molecular utilizados, especialmente los microsatélites, resultó invaluable para
identificar procesos de contacto secundario entre linajes genéticos, en el primer capítulo titulado
“Microsatellites: advantages and drawbacks in phylogeographic analysis”, se realizó una
revisión bibliográfica considerando los principales análisis que pueden realizarse con este tipo
de marcador molecular, así como los alcances y las limitaciones a considerar cuando los
microsatélites son utilizados en un contexto filogeográfico.

Capítulo 1

Microsatellites: advantages and drawbacks in phylogeographic
analyses

Microsatellites: advantages and drawbacks in phylogeographic analyses
Marco Suárez-Atilano1,2
1Laboratorio de Genética y Ecología, Instituto de Ecología, Universidad Nacional
Autónoma de México.
Circuito Exterior s/n Anexo Jardín Botánico, Apartado Postal 70-275 Ciudad
Universitaria, UNAM 0451 México, D. F. Tel. 01(55)56 22 90 28.
2Posgrado en Ciencias Biológicas, Universidad Nacional Autónoma de México,
Coyoacán,
México, DF 04510, México
E-mail: suarati@gmail.com

Abstract
In the last decades, microsatellites have become a widespread molecular marker, evidencing
their utility in several disciplines like population and landscape genetics. Characteristics like high
levels of polymorphism and codominant inheritance, combined with the development of new
statistical and computational approaches, have played an important role for the advance and
implementation of these markers in genetic and evolutionary studies. More recently,
phylogeography has adopted microsatellites as a common tool, particularly for the detection of
genetic structure on ample spatial ranges (regional or continental), measures of gene flow,
contrasting patterns of sex-biased dispersal and exploring evolutionary relationships among
populations. Nevertheless, the use of microsatellite loci in deeper temporal analyses have
important limitations and drawbacks. High levels of homoplasy, complex mutation processes,
their tandem molecular structure and sampling limitations in terms of number of loci and
individuals, are some of the main challenges when using microsatellites. However, both nuclear
microsatellites and organelle sequence-based DNA markers can be combined in a
phylogeographical framework to achieve more adequate and robust analyses, in order to explain
the historical (evolutionary) and current (ecological) processes associated with the geographic
distribution of genetic linages.

Keywords: allelic frequencies, genetic structure, gene flow, hybridization, microsatellites,
phylogeography

Introduction
The distribution of genetic variation within species is the result of a combination of molecular and
population-level processes, including mutation, genetic drift, gene flow, selection and
demography (Wang 2010). Different kinds of molecular markers with different rates of
substitution and modes of inheritance may capture signatures of population processes occurring
at different times in the evolutionary scale (Avise et al. 1987, Thomson et al. 2010). Although,
mistakes can result from using markers that inadequately capture information from the
evolutionary period of interest (Chenuil 2006, Thomson et al. 2010).
During the last decade, genetic approaches have become much more efficient, powerful and
accessible for answering ecological and evolutionary questions in fields like conservation
biology, molecular ecology and population genetics. Within this methodological revolution,
nuclear microsatellites have played an important role as a result of their widespread use in
population studies and of methodological and bioinformatic advances. Microsatellites have
become a popular marker due to their high levels of variability, scoring ease and reliability,
codominant inheritance and short lengths, which makes them useful for studies with samples as
difficult as fossils and hair or feces collected in the field (Luikart & England 1999, Selkoe &
Toonen 2006). Currently, microsatellites are the most widely used molecular marker in
population genetics (Garrick et al. 2010), and our understanding of their mutational behavior,
function, evolution and distribution in the genome across several taxa is rapidly increasing
(Goldstein & Schlötterer 1999, Li et al. 2002, Selkoe & Toonen 2006). The microsatellite’s high
level of polymorphism, combined with advances in molecular techniques and mathematical and
computational approaches, have allowed their implementation to answer diverse biological
questions, for example in conservation or landscape genetics at the ecological level, , to deeper
historical evolutionary analyses like phylogeography.
Phylogeography is broadly defined as the study of the ecological and evolutionary
processes responsible for the historical and contemporary geographic distribution of genetic
linages, within and among closely related species (Avise 1987, 2000). Initially, this discipline
used predominantlyusedorganelle loci (mitochondrial/chloroplast DNA sequences) and
allozymes, but more recently microsatellites, have been incorporated in phylogeographic
approaches.. Pioneer studies explored process of domestication in some cattle-species (Bradley
et al. 1994, MacHugh et al. 1997, Hanotte et al. 2000) and evolutionary history of human
populations (Takezaki & Nei 1996, Perez-Lezaun 1997) with the help of these markers.
14

Henceforth, microsatellites proved to be a powerful complement to the most common nuclear
and mitochondrial sequences, , and more so in replacing allozymes (Zink 2010).
In this review, I describe the characteristics of microsatellites and then explore and discuss
the main analytical methods normally used with microsatellites in a phylogeographic framework.
In addition, I highlight their advantages and limits, based on particular aspects of
phylogeographic analyses, namely detection of genetic structure, gene flow (including sex-
biased dispersal), identification of hybrid zones and phylogenetic inference within and among
species.

Microsatellites: a review
Microsatellites are tandem repeats of 1–6 nucleotides found at high frequency in the nuclear
genomes (and sometimes in organelle genomes) of most taxa. They are codominant markers
with a Mendelian inheritance and highly polymorphic in natural populations (Luikart & England
1999), which are mostly considered neutral molecular markers (Jarne & Lagoda 1996, Selkoe &
Toonen 2006). The flanking regions (DNA region surrounding a microsatellite locus) are
generally highly conserved across individuals of the same or of closely related species; also, the
microsatellite repeat itself (tandem repeat) usually has a high mutation rate and may vary from
one individual to the other (5 X 10-4; Goldstein & Schlötterer 1999, Whittaker et al. 2003). The
most commonly proposed mutation processes for microsatellites is slippage and proofreading
errors during DNA replication, which primarily change the number of repeat units and the length
of the repeat motif. Such high mutation rates make of microsatellites an excellent tool for the
detection of recent and ongoing microevolutionary processes. In contrast, markers with lower
mutation rates, such as mitochondrial DNA (mDNA) with rates of approximately 6 X 10 -8 (Haag-
Liautard et al. 2008), are ideal for capturing molecular signatures of more ancient processes. Auseful analogy comes from a well-known issue in phylogenetic-inference analysis, in which
increasing genetic variability of a molecular marker provides greater power to detect recent
branching events, but reduced power to assess deeper phylogenetic history because high rates
of substitution can overwrite older signatures (Wang 2010).In the latter case nuclear DNA
markers, with slower mutation rates, can resolve more profound (ancient/historical) phylogenies
(Zink& Barrowclough 2008).
Because the original goal of phylogeography is to bridge population genetics with
phylogenetics, this discipline also considers hybrid zone dynamics and speciation patterns
(Avise et al. 1987, Hewitt 2001). Under this scenario, it is important to use contrasting markers
that may detect different processes at various temporal and spatial scales. Analyses based on
15

microsatellite loci, now common in phylogeographic studies, are based on multi-locus genotype
and allele frequencies (Garrick et al. 2010). While genotype frequencies may change in a few
generations, especially by sexual recombination, population allele frequencies are usually less
labile in the short-term (Snnucks 2000, Chenuil 2006, Garrick et al. 2010). The latter allows to
improve mtDNA surveys (in the case of vertebrates), which yield a single, maternally inherited
haplotype that is taken to represent the entire organismal history. However, there are limits and
drawbacks when using microsatellite data, mainly related with the amount of homoplasy (alleles
that are identical by state, but not necessary identical by descendent), the relatively long
coalescence times, and their limited treatment by tree-based analyses (Brito and Edwards 2008,
Zink 2010).

The evaluation of genetic and phylogeographic structure
Because ecological and evolutionary processes necessarily occur in a geographic context,
biodiversity distribution patterns are usually structured in space (Lomolino & Heaney 2004,
Diniz-Filho et al. 2008). Nevertheless, analysis of the genetic structure within or among
populations and at higher phylogenetic population-levels (races, subspecies or species), should
be evaluated considering not only spatial, but also temporal scales. Microsatellites are used
typically in population genetics to estimate genetic diversity, structure and distribution (Jarne &
Lagoda 1996, Balloux & Lougon-Moulin 2002, Kimberly & Selokoe 2006, Diniz-Filho et al. 2008).
Phylogeography, on the other hand, expanded such statistics and methods to the regional or
even continental levels, especially in cases where the genetic structure could be explained in
terms of the presence of biological, physical or historical barriers, within the context of
geographic distributions (phylogeographic structure). Based on the analyses used in population
and landscape genetics, phylogeographic analyses with microsatellites often follow two main
approaches: a priori delimitation of populations, a more traditional way in which a number or
specific sampling sites with a known number of individuals are used, where every single locality
is treated as a population (Pritchard et al. 2000, Evanno & Goudet 2005, Guillot et al. 2009).
However, this approach is not appropriate under certain scenarios where the study includes
widespread species with a continuous distribution, and/or sampling localities with dissimilar
sample sizes (e.g. localities that are represented by museum samples) (Balloux & Moulloin
2002). In these cases, ideally, one should use methods that do not require assumptions of
population boundaries beforehand, which implies that the individual is the operational unit of
study (Guillot et al. 2005a). The result of this approximation is the a posteriori delimitation of
genetic clusters of individuals (individual clustering analysis) (Pritchard et al. 2000).
16

Several statistical approaches can be used to produce these clusters (Table 1). The most
widely used methods are based on Bayesian computation analysis to maximize Hardy-Weinberg
and linkage equilibrium across clusters (Pritchard et al. 2000, Evanno & Goudet 2005), while
other methods search for areas of strong changes in allele frequencies, also called genetic
discontinuities, using explicitly the individual or population spatial location, such as the
Monmonier algorithm (Monmonier 1973, Patten & Smith-Patten 2008, Garrido-Garduño &
Vázquez-Domínguez 2013). Finally, methods based on principal component analysis (PCA) can
determine genetic isolines similar to contour lines on topographical maps (Smouse & Paekall
1999, Wesser et al. 2004, 2006). Because clustering analysis with different statistical methods
can sometimes differ in the number of clusters or in the detection of genetic discontinuities, a
recommended approach is to compare results and detect the main factors responsible for the
genetic structuring (Latch et al. 2006; François & Durand 2010), for instance to evaluate the
correspondence between the genetic clusters’ spatial distribution and the presence of biotic or
abiotic potential barriers potentially (Guillot et al. 2009).. In addition, individual or population-
based clusters might not reflect “real populations”, in terms of biological defined units (Rossiter
et al. 2007, Fontaine et al 2007, 2010, Baedell et al. 2010), given the wide geographic scale
commonly used in phylogeographic studies.

Exploring gene flow
The amount, pattern and directionality of gene flow are directly associated with the degree of
genetic differentiation and structure of populations. Migration rates are notoriously difficult to
measure by direct tracking of individuals or gametes in natural populations (Frankman et al.
2002). Furthermore, immigrants may not survive or breed (contribute into the genetic pool) in
their new habitat, consequently gene flow, rather than migration rate, needs to be estimated
(Frankhman et al. 2002, Allendorf & Luikart 2007).
Phylogeography takes into account different gene flow measures based on genetic
population analyses. The simplest approaches are based on inbreeding or fixation indices (FST
or RST). These “F-statistic methods” provide an estimation of the number of migrants per
generation (Nm). Nevertheless, this parameter is derived from the island model of migration,
where population size and migration are assumed to be constant in time (Wright 1969), and thus
does not consider that the number of migrants per generation might reflect historical instead of
current gene flow, with no estimation of directionality (Frankman et al. 2002). More recent
approaches based on statistical methods (maximum-likelihood or Bayesian computation), use
assignment tests to detect migrants (Rannala & Hartigan 1996, Cournet et al. 1999, Wilson &
17

Rannala 2003). These methods employ multilocus genotypes to identify migrant individuals or
those with recent migrant ancestry in a set of sampling populations, with the power to detect
immigrant ancestors from up to two generations back (Rannala & Mountian 1997).
Disadvantages of these methods include their sensibility to the presence of null alleles (Rannala
& Hartigan 1996, Wilson & Rannala 2003); also, their poor performance both when sample sizes
across populations or localities are small and dissimilar andwhen , and the differentiation of
allele frequencies among populations is high (Rannala & Mountian 1999, Wilson & Rannala
2003).
Modern estimators based on coalescence individual-based genealogy modeling allow to
estimate a measure of magnitude, direction and time ofgene flow. Demographic parameters can
be simultaneously estimated by ML and Bayesian coalescent models like population size (via
Theta) and migration rates with the software MIGRATE-N (Beerli 2004, 2006, Beerli & Palczewki
2010),. This method also applies the testing of migration scenarios (e.g. sink-sources populationor full-asymmetrical migration) via log-likelihood scores. Another option, defined as Isolation with
Migration model of population divergence (IM, Nielsen & Wakeley 2001),considers Bayesian-
MCMC computation (IMa2; Hey & Nielsen 2004, 2007). This model is able to distinguish
between complete isolation and divergence with gene flow, considering three possibilities for
each pairwise-population comparison; high gene flow in the present or near present, gene flow
after divergence in the past and divergence with complet isolation (Hey et al. 2004, 2007, Hey
2005). Recently, this IM model was extended to comparisons among multiple populations, even
if they have modest levels of gene flow or involve multiple populations of closely related species
(Hey 2010b). The IM method is useful for the analysis of sympatric speciation, where gene
exchange occurs since the onset of divergence and secondary contact following a period of
geographic isolation,including the intermittently occurrence of gene flow (Pinho & Hey 2010).

Sex-biased dispersal
A special case of within population “gene flow” should be considered for organisms with
separated sexes and subsequent (often) asymmetrical dispersion; such sex-biased dispersal
may have different effects on the evolution of ecological and genetic characteristics (Goudet et
al. 2002). In this context, the simultaneous use of markers with different modes of inheritance
(for example, autosomic, sex-linked and uniparentally haploid markers) can reveal contrasting
patterns of distribution of a species genetic variation (Carparroz et al. 2009). Nevertheless,
comparisons between different kinds of markers can be difficult, because haplotype data often is
based on DNA sequences and diploid considering microsatellites data is derived from allelic
18

frequencies. It is also necessary to consider other characteristics like the fourfold (twofold in the
case of monoecious species) difference in effective population size between organelle and
autosomal markers (Birky et al. 1989). For example, mtDNA may inherently show higher levels
of differentiation at equilibrium than microsatellites, even though the microsatellite mutation rate
is roughly an order of magnitude higher (Caparroz et al. 2009). Furthermore, interpretation of the
relative influence of historical and contemporary factors solely on the basis of genetic data can
be complicated, given that reproductive or breeding behavior observations and knowledge of life
history traits are relevant to the interpretation of the genetic structure patterns (Scribner et al.
2001, Frankman et al. 2004).
On the other hand, analyses of the genetic structure of separatedsexes within individuals
can be done via gender hierarchical AMOVAs (Excoffier et al. 1992) or assignment tests (Favre
et al. 1997, Smouse & Paekall 1999), useful for detecting phylopatry. Both approaches are
based on the following rationale: i) one gender does not transmit haplotypes to subsequent
generations (e.g. males do not transmit mtDNA haplotypes), and thus a male immigrant
haplotype is only transiently available in the population (i.e. while it is alive); ii) Reproductive
female immigrants transmit their haplotypes to subsequent generations, thereby having a
greater potential than males to homogenize mtDNA genetic structure by dispersal. In this case,
stronger mtDNA genetic structure for females than males provides compelling evidence for
male-biased dispersal (O’Corry-Crowe et al. 1997, Lukoscheck et al. 2008).

The special case of hybridization processes
Hybridization between individuals from genetically distinct populations or linages is of interest
across many fields in biology. In addition, the detection of hybrid zones, as regions where
individuals from genetically distinct populations interbreed and form genetically mixed offspring,
have been recognized as a challenge in evolutionary studies, particularly those concerning
models of speciation, selection, maintenance of species boundaries and conservation; it
becomes more challenging, in cases where hybridization between endemic and introduced
species occurs (Anderson & Thompson 2002). The use of molecular markers to detect natural
and human-mediated cross breeding of populations or species has been widely explored in the
context of phylogeography (Avise 2004).
Sequence-based molecular markers were used typically to detect hybrid zones, but the
simultaneous use of different markers can be more helpful to differentiate processes of
hybridization, like sex-asymmetric hybridization or introgression. Different methodscommon in
population genetcis can be used for hybridization analyses: methods based on allele frequencies
19

can be implemented for the detection of hybrids based on the ancestral allele frequencies of the
known or putative original groups. Every allele frequency at each locus is considered in the
analysis and several statistical approaches, like regression-based least squares, maximum-
likelihood or Bayesian analysis, are used to derive the best estimate of the admixture proportion,
in a similar mathematical way of clustering or assignment analysis (Chakraborty 1985, Anderson
& Thompson 2002). Other methods allow the detection of hybrid linages based on diagnostic
loci, which exhibit private alleles or discrete allelic ranges exclusive of certain well-defined
linages, like subspecies or closely related-species. This method has been used only in a few
cases, such as the detection of introgression of the domestic dog into the Ethiopian wolf (Canis
simensis), the most threatened canid species (Gotteli et al. 1994, Vilá et al. 1999). Another
example is the domestication history in African humpless cattle (Bos taurus), where
introgression detection of Indian zebu cattle Bos indicus was detected (MacHugh et al. 1997,
Hanotte et al. 2000). The principal limitation of private alleles-based methods is that they require
a certain number of diagnostic loci and, if possible, a good sampling of the parental species to
prevent misclassification of individuals (Goldstein & Schlötterer 1999, Anderson & Thompson
2002).
Studies of hybridization exclusively using microsatellites are limited, probably because of
their high mutation rates, which restrict the detection of ancient hybridization. On the other hand,
when hybridization is recent or when a good sample of the original linages is available,
microsatellites are very effective. Indeed, microsatellites-based analyses are used to contrast
signals detected with other markers like mtDNA or nuclear introns (Sequeira et al. 2008,
Rodríguez et al. 2010). When the historical context is well known, patterns of secondary contact
and recent gene flow in natural populations can be detected. An example of temporal limitations
of microsatellites to distinguish ancient hybridization process is the Iberian salamander
Chioglossa lusitanica; while mt-DNA and allozymes support the presence of a hybrid zone
between Douro and Mondego Rivers (Portugal), Bayesian multilocus analysis inferred the
presence of three genetic clusters, in this case individuals mixed population origin in a restricted
geographic area was detected as a third independent cluster (Alexandrino et al. 1997, 2000,
2002, Sequeira et al. 2008)

Phylogenetic inferences with microsatellites
In previous sections, the importance of considering the rates of variability of the loci used in
phylogenetic inference analyses is emphasized. In this context, microsatellites, with mutation
rates several times higher than other noncoding nuclear and organelle DNA regions, could be
20

inappropriate to detect ancient divergence signatures both within and among species. Another
limitation could be their unequal levels of polymorphism among neighboring species (that could
be highly variable) or even the lack of variability when evaluating microsatelliteloci across
related taxa or related mutation processes on flanking regions in closely related species
(Felsenstein 2004, Barbará et al. 2007). Furthermore, if levels of homoplasy are high in
microsatellites make especially difficult in this kind of studies where prevalence the importance
of population structure analysis in a geographic context (Balloux & Lugon-Moulin 2002, Avise
2004).
DNA sequences represent aligned homologous molecular characters, scored as haplotypes.
To represent evolutionary relationships among these haplotypes (or individuals), several
methods for the construction of phylogenetic trees can be implemented: genetic distances based
methods, maximum-parsimony, maximum-likelihood or Bayesian inference (Garrick et al. 2010).
Alternatives to phylogenetic trees are network approaches, such as statistical parsimony or
median-joining algorithms, which can be more appropriate for the typically shallow population-
level sequence divergences, because they allow for reticulation and the presence of extant
ancestral sequences (Posada & Crandall 2001, Knowles 2009). Furthermore, the patterns and
polarity of a network can be informative about historical events like recent population expansions
or for distinguishing between contemporary versus historical gene flow (Templeton 2004). In
contrast, phylogenetic inferences based on microsatellites are more restricted, although
comparisons among individuals and populations are possible, specifically phylogenetic trees
based on genetic distances. Also, “kinship” distances may be implemented to describe
phylogenetic relationships between individuals (Table 2). A first approach in using genetic
distances was made by Cavalli-Sfroza and Edwards (1967), derived like a relationship between
the chord genetic distance and the kinship coefficient (Dkf; Cavalli-Sforza & Bomer 1971). Later
on, Chakraborty and Jin (1993) developed a distance based on the average number of alleles
shared between individuals (DAS; Chakraborty & Jin 1993). An advantage of such inter-individual
genetic distances is that they eliminate the need for defining populations a priori , which could be
helpful in studies of recent population movements, facilitating the detection of individual
admixture and estimation of potential recent migration and geographic affiliation (Ruíz-Linares et
al. 1993, Goldstein & Schlöterer 1999). A limitation of this distance measure is the high number
of hypervariable loci (>15 loci, He>75%) needed to achieve a reliable approximation
(Chakraborty & Jin 1993, Luo et al. 2004).
When considering among-population comparisons, two main approaches differing in their
assumptions about mutational processes are the most used (Table 2): a) a genetic distance
21

based on the Infinite-Allele model of mutation (IAM; Kimura & Crow 1964), in which each
mutation creates a novel allele at a given rate (u), Because this model does not allow for
homoplasy, identical alleles share the same ancestry and are necessarily identical-by-descent
(IBD). Accordingly, it only considers the proportion of allele frequencies between populations
(Takezaki & Nei 1996) and discards the allele size information. Furthermore, it makes no
assumptions about constant population or mutation rates among loci, thus when used to build
phylogenetic tress, the branch lenghts are not proportional to evolutionary rates. Frequently,
different IAM distances achieve a similar topology, that is, they have a higher probability of
depicting the correct tree topology (Pérez-Lezaun et al. 1997, Destro-Bisol et al. 2000, Suk &
Nelf 2009, Takezaki & Nei 1996, 2008).b) another measure takes into account the typical
mutational process detected for microsatellites, the Step-wide mutation model (SMM Kimura &
Ottha 1978), in order to estimate the time of divergence between populations. Under this
mutation scenario, novel alleles are created by adding or deleting a single repeat unit, with an
equal probability u/2 in both directions. Consequently, alleles of very different sizes will be more
distantly related than alleles of similar sizes (i.e. the SMM has a memory of allele size). The
algorithm for this SMM genetic distance includes deviations of allele size or allelic range; it also
generates branch-lengths proportional to evolutionary change and allow for a direct estimation of
divergence times among populations when mutation rates are known (Goldstein et al 1995).
Some limitations are that different SMM distances fail to achieve a similar topology as the IAM
do, and they also are more sensitive when population sizes are highly different (Reynolds et al.
1983, Goldstein et al. 1995, Ruzzante 1998, Calafell, et al. 2000, Scrimbed et al. 2001, Takezaki
& Nei, 1996, 2008).
Topology incongruences can be achieved when based on genetic distances, for which
different explanations have been proposed , mainly in terms of the genetic distances that use a
specific mutation process. Because the effectiveness of the different estimators in providing an
accurate resolution of population structure depends on the histories of the organisms under
study (Balloux & Lugon-Moulin 2002; Hardy et al. 2003), IAM-based distance methods imply that
population differentiation is recent, and is mostly governed by migration and genetic drift. That is,
the populations are still connected by present-day gene flow and the time that has passed since
their divergence is short enough to hamper any other mutation and isolation effects on the
population differentiation process (Destro-Bisol et al. 2000; Balloux & Lugon-Moulin 2002,
Segarra-Mongarres & Catalana 2008). Another explanation for topology incongruences among
distances concerns the number of markers used. It has been shown that increasing the number
of loci has a larger and more important effect on the sampling variance than increasing the
22

sample size (Shriver et al., 1995; Takezaki & Nei, 1996, 2008). For example, Takezaki and Nei
used 783 highly informative dinucleotide microsatellite loci to analyze the phylogenetic
relationships between 12 representative human populations, and recovered the same topology
with high levels of statistical (bootstrap) support, independently of the mutation model used
(Takezaki & Nei 2008). Genetic distances play an important role in estimating divergence times
and processes among populations; however, they can be highly affected by demographic
processes, like migration and population dynamics, in combination with sampling schemes,
factors that should be considered carefully (Zhivotovsky 2001).
One last issue regarding phylogenetic inferences with microsatellites refers to the grouping
methods implemented to build distance trees from distance matrices. Two widely used
algorithms for microsatellite markers are UPGMA (Unweighted Pair-Group Method with
Arithmetic Mean; Sokal & Michener 1958) and NJ (Neighbor-Joining, Saitou & Nei 1987).
Studies have shown that UPGMA works fine describing population structure when populations
have had a bifurcating history of fragmentation, but severely distorted genetic relationships for
the linear and two-dimensional stepping-stone models. The NJ algorithm is adequate for a
broader range of evolutionary histories, including the linear stepping-stone model, which
generates different patterns of ramification between populations in response to different gene
flow and diversification patterns (Callafel et al. 2000, Destro-Bisol 2000, Kalinowsky 2009).

Are there more ‘adequate’ microsatellites for phylogeography than others?
As discussed in the previous sections, the quantity of information that can be obtained from
microsatellite data in a phylogeographic framework depends on several factors, like sampling
scheme or statistical methods. Another factor to be considered is that the information that can be
obtained from different kinds of microsatellitescan vary, :based on their tandem arrays,
microsatellites may be classified into three main groups: pure (also called perfect), compound (a
mixture of different arrays) and interrupted (or imperfect), where any combination between these
groups is possible (Table 3). They can also be divided according to their number of nucleotides
per repeat, such as di, tri and tetranucleotides. Within each type, attention has been also
focused on a limited number of motifs, for example, dinucleotides with CA and GA repeats are
preferred in animal studies (Jarne & Lagoda 1996). Rossenberg and collaborators developed a
statistic called Informativeness for assignment (In), defined as the amount of information that
multiallelic markers provide about individual ancestry (Rossenberg et al. 2003). They used 377
microsatellite markers (45 dinucleotides, 58 trinucleotides and 274 tetranucleotides) genotyped
in 1,056 individuals from 52 human populations, to analyze the degree of informativeness
23

achieved with each group of microsatellites. Their results showed that the most informative
were pure dinucleotides, followed by perfect tri and tetra nucleotides (Table 3). This degree of
informativeness has been attributed to the levels of polymorphism (Kalinowsky 2002), where the
perfect dinucleotide shows the most variation among markers (Rossenberg et al. 2002, 2003),
and to the mutation process. Whereas pure loci may adjust to a most strict SMM, imperfect
repeat microsatellites may conform to a two-phase model (an intermediate between the IAM and
the SMM, Di Renzo et al. 1994, Garza and Williamson 2001); being potentially more powerful for
detecting other processes like genetic bottlenecks (Cristescu et al. 2010, Funk et al. 2010).
Perfect dinucleotides also have been used to estimate divergence times between population
pairs by using the formula: (δµ) 2 = 2β; where β is the mutation rate and  is the generation time
(Goldstein et al., 1995a). Ddivergence times may be calculated from a regression based on
((δµ)2. With this approach, Goldstein et al. (1995b) dated the African and non-African human
populations divergence to 156,000 years ago, which is highly consistent with coalescent time
estimates determined from mtDNA data (around 200,000 years ago; Goldstein & Schlöterer
1999). However, it should be noted that these estimates are prone to errors, particularly
associated with the use of calibration points, generation time values and the added bias of
mutation rate estimates. This last error might be large because mutation rates in microsatellites
differ highly among loci and among closely-related species (Di Rienzo et al. 1994, Goldstein &
Schlöterer 1999). For example, the estimated divergence time between two putative species of
chamois (Rupicapra rupicapra and Rupicapra pyrenaica) with ((δµ) 2 based on 25 di and
trinucleotide perfect microsatellites, and assuming the same average mutation rate that in
human microsatellites, was 57,000 years, which is five times lower than the date determined
from mitochondrial RFLP haplotypes and the fossil record (280,000 years; Pérez et al. 2002,
Rodríguez et al. 2010).

Conclusions

In this brief review, the analyses that allow microsatellites to provide robust results in a
phylogeographic analysis framework were emphasized. Several of these methods are not recent
in terms of their origin and their statistical inference. For example, the use of genetic distances
for phylogenetic analyses is being discontinued from sequence-based analysis favoring other
methods such maximum likelihood and Bayesian inferences. However, phylogeographic studies
based on microsatellite data apply those metrics more and more often, and are being
24

complemented by analytical methods from population and landscape genetics, mainly for
detecting genetic structure.
In addition, we highlighted some drawbacks and limitations that should be considered for
analysis planning and results interpretation. The potential homoplasy, specific mutation process
and the different degree of informativeness in different kinds of microsatellite loci are the main
limitations to infer deeper and wider historical process in evolutionary terms. Much of these
limitations are directly related with the molecular ‘behavior’ of this marker, restricting its utility to
within-species evolutionary studies. The most important and practical lesson is that as more
thorough and complete is the genetic information we have, not only our knowledge about
historical processes responsible for the geographic distribution of genetic diversity within an
among species will be more robust, but also the information will be more reliable in terms of
current and future genetic and biodiversity conservation (Frankham 2010, Allendorf et al. 2010),
as well within certain conservation issues as the detection of introgression of invasive species on
natural populations and the associated evolutionary consequences (Ficetola et al. 2008, Suárez
& Tsutsui 2008).
Phylogeography based on microsatellite markers data is growing as a legitimate and well-
defined approach based on the analysis of geographic structure and distribution of the allelic
frequencies, in a similar way that genealogical linages are analyzed in the case of sequence-
based DNA markers. Being clearly aware of the microsatellites inference limitations and
combining the evidence provided by microsatellites and other kind of markers, and considering
biological information, has facilitated the achievement of robust and integrative phylogeographic
inferences based on microsatellites.

Acknowledgements
Thanks to Ella Vázquez-Domínguez and Juan Pablo Jaramillo-Correa for their guidelines and
revision of this manuscript. Special acknowledgments to Gabriela Parra-Olea and Daniel Piñero
for their invaluable comments related with the topics reviewed in this work. This paper
constitutes a partial fulfillment of the Graduate Program in Biological Sciences of the National
Autonomous University of México (UNAM). M.S.A acknowledges the scholarship and financial
support provided by the National Council of Science (CONACyT 346511), Scholarship Program
for Postgraduate Studies (PAEP) and UNAM.

References

Alexandrino J., Ferrand N. and Arntzen J. W. 1997 Genetic variation in some populations of the
golden-striped salamander, Chioglossa lusitanica (Amphibia: Urodela), in Portugal.
Biochem. Genet. 35, 371–381.
Alexandrino J., Froufe E., Arntzen J. W. and Ferrand N. 2000 Genetic subdivision, glacial refugia
and postglacial recolonization in the golden-striped salamander, Chioglossa lusitanica
(Amphibia: Urodela). Mol. Ecol. 9, 771–781.
Alexandrino J., Arntzen J.W. and Ferrand N. 2002 Nested clade analysis and the genetic
evidence for population expansion in the phylogeography of the golden-striped salamander,
Chioglossa lusitanica (Amphibia: Urodela). Heredity 88: 66–74.
Allendorf F.W. and Luikart G. 2007 Conservation and Genetics of Populations. Blackwell
Publishing. UK.
Allendorf F. W., Hohenlohe P.A. and Luikart G. 2010 Genomics and the future of conservation
genetics. Nat. Rev. Gen. 11, 697-709
Anderson E, C. and Thompson E. A. 2002 A Model-Base Method for Identifying Species Hybrids
Using Multilocus Genetic Data. Genetics 160,1217-1229
Avise J. C, Arnold J., Martin-Ball R., Bermingham E., Lamb T., Neigel J. E., Reeb C. A.,
Saunders N. C. 1987 Intraspecific Phylogeography: The Mitochondial DNA Bridge Between
Population Genetics and Systematics. Ann. Rev. Ecol. Syst. 18, 489-522
Avise, J. C. 2000 Phylogeography: The history and formation of species. Havard
University Press. USA.
Avise J. C. 2004 Molecular Markers, Natural History and Evolution. Sinauer Associates Inc.
Publishers. Sunderland, Massachusetts
Calafell F., Pérez-Lezaun A. and Bertranpetit J. 2000 Genetic distances and microsatellitediversification in humans. Hum. Genet. 106, 133-134
Balloux F. and Lougon-Moulin N. 2002 The estimation of population differentiation with
microsatellite markers. Mol. Ecol. 11, 155-165
Beadell J. S., Hyseni C., Abila P.P., Azabo R., Enyaru J., Ouma J.O., Mohamen Y.O., Okedi
L.M., Aksoy S. and Caccone A. 2010 Phylogeography and Population Structure of Glossina
fuscipes fuscipes in Uganda: Implications for Control of Tsetse. PLoS Negl Trop Dis 4,
e636. doi:10.1371/journal.pntd.0000636
26

Beerli, P. 2004 Effect of unsampled populations on the estimation of population sizes and
migration rates between sampled populations. Mol. Ecol. 13, 827-836.
Beerli, P. 2006 Comparison of Bayesian and maximum likelihood inference of population genetic
parameters. Bioinformatics 22, 341-345.
Beerli, P. and Palczewski M. 2010 Unified framework to evaluate panmixia and migration
direction among multiple sampling locations. Genetics 185, 313-326.
Bradley D. G., Machugh D. E., Loftus R. T. et al. 1994 Zebu-taurine variation in Y chromosomal
DNA a sensitive assay for genetic introgression in West African trypanotolerant cattle
populations. Anim. Genet. 25, 7-12.
Brito P. H. and Edwards S. V. 2009 Multilocus phylogeography and phylogenetics using
sequence-based markers. Genetica 135, 439-455.
Birky C. W., Fuerst Jr. P. and Maruyama T. 1989 Organelle gene diversity under migration,
mutation, and drift equilibrium expectations, approach to equilibrium, effects of
heteroplasmic cells, and comparison to nuclear genes. Genetics 121, 613-627.
Caparroz R., Miyaki C. Y. and Baker A. J. 2009 Contrasting Phylogeographic Patters in
Mitochondrial DNA and Microsatellites: Evidence of Female Phylopatry and Male-Biased
Gene Flow Among Regional Populations of the Blue-and-Yellow Macaw (Psittaciformes: Ara
ararauna) in Brazil. The Auk 126, 359-370.
Cavalli-Sforza L. L. and Edwards A. W. F. 1967 Phylogenetic Analysis, Models and Estimation
Procedures. Ame. J. Hum. Genet. 19, 233-257.
Cavalli-Sforza L. L. and Bodmer W. F. 1971 The genetics of human populations. W. H.
Freeman, San Francisco.
Chakraborty R. 1985 Paternity testing with genetic markers: are Y-linked genes more efficient
than autosomal ones? Am. J. Med. Genet. 21, 297-305.
Chakraborty R. and Jin L. 1993 A unified approach to study hypervariable polymorphisms:
statistical considerations of determining relatedness and population distances. In DNA
Fingerprinting: Current State of the Science (ed S. D. J. Pena, A. J. Jefferys, J. Epplen, R.
Chakraborty) pp 153-175. Birkhauser, Basel.
Chen C., Durand E., Forbes F. and François O. 2007 Bayesian clustering algorithms
ascertaining spatial population structure: a new computer program and a comparison study.
Mol. Ecol. 7, 747-756.
Chenuil A. 2006 Choosing the right molecular genetic markers for studying biodiversity: from
molecular evolution to practical aspects. Genetica 127, 101-120.
27

Corander J., Waldmann P., Sillanpää M. J. 2003 Bayesian analysis of genetic differentiation
between populations. Genetics 163, 367-374.
Corander J., Waldmann P., Marttinen P. and Sillanpää M. J. 2004. BAPS 2: enhanced
possibilities for the analysis of genetic population structure. Bioinformatics 20, 2363-2369.
Cornuet J. M., Piry S., Luikart G., Estoup A. and Solignac M. 1999 New methods employing
multilocus genotypes to select or exclude populations as origins of individuals. Genetics
153, 1989-2000.
Cristescu R., Sherwin W. B., Hansdasyde K., Cahill V. and Cooper D. W. 2010 Detecting
bottlenecks using BOTTLENECK 1.2.02 in wild populations: the importance of the
microsatellite structure. Conserv. Genet. 11, 1043-1049.
Destro-Bisol G., Spedini G. and Pascali V. L. 2000 Application of different genetic distance
methods to microsatellite data. Hum. Genet. 106, 130-132.
Diniz-Filho J. A. F., de Campos-Talles M. P., Bonatto S. I., Eizirik E., de Freitas T. R. O., de
Marco P. Jr., Santos F. R., Sole-Cava A. and Soares T. N. 2008 Mapping the evolutionary
twilight zone: molecular markers, populations and geography. J. Biogeogr. 35, 753-763.
Di Rienzo A, Peterson A. C., Garza J. C., Valdes A. M., Slatkin M. and Freimer N. B. 1994
Mutational processes of simple-sequence repeat loci in human populations. Proc. Natl.
Acad. Sci. 91, 3166–3170.
Dupanloup I., Schneider S. and Excoffier L. 2002 A simulated annealing approach to define the
genetic structure of populations. Mol. Ecol. 11, 2571-81.
Durand E., Jay F., Gaggiotti O. E. and François O. 2009 Spatial Inference of Admixture
Proportions and Secondary Contact Zones. Mol. Biol. Evol. 26, 1963-1973.
Evanno G., Regnaut S. and Goudet J. 2005 Detecting the number of clusters of individuals using
the software STRUCTURE: a simulation study. Mol. Ecol. 14, 2611-2620.
Excoffier, L., Smouse P. E., and Quattro J. M. 1992 Analysis of molecular variance inferred from
metric distances among DNA haplotypes: application to human mitochondrial DNA
restriction data. Genetics 131, 479-491.
Falush D., Stephens M. and Pritchard J. K. 2003 Inference of Population Structure Using
Multilocus Genotype Data: Linked Loci and Correlated Allele Frequencies. Genetics 164,
1567-1587
Favre L., Balloux F., Goudet J. and Perrin N. 1997 Female-biased dispersal in the monogamous
mammal Crocidura russula: evidence from field data and microsatellites patterns. Proc. Roy.
Soc. Lond. B. 264, 127-132.
Felsenstein J., 2004 Inferring Phylogenies. Sinauer Associates. Sunderland, MA.
28

Ficetola G. F., Bonin A. and Miaud C. 2008 Population genetics reveals origin and number of
founders in a biological invasion. Mol. Ecol. 17, 773-782.
Fontaine M. C., Bair S. J. E., Piry S., et al. 2007 Rise of oceanographic barriers in continuous
populations of a cetacean: the genetic structure of harbor porpoises in Old World waters.
BMC Biology 5, 30-46.
Fontaine M. C., Tolley K. A., Michaux J. R., et al. 2010 Genetic and historic evidence of climate-
driven population fragmentation in a top cetacean predator: the harbor porpoises in
European water. Proc. R. Soc. B. 277, 2829-2837.
François O. and Durand E. 2010 Spatially explicit Bayesian clustering models in population
genetics. Mol. Ecol. Res. 10, 773-784.
Frankham R., Ballou J. D. and Briscoe D. A. 2002 Introduction to Conservation Genetics.
Cambridge University Press, Cambridge, UK.
Frankham R., 2010 Where are we in conservation genetics and where do we need to go?
Conserv. Genet. 11, 661-663.
Funk W. C., Forsman E. D., Johnson M., Mullins T. D., Haig S.M. 2010 Evidence for recent
population bottlenecks innorthern spotted owls (Strix occidentalis caurina). Conserv. Genet.
11, 1013-1021
Garrick R. C., Caccone A. and Sunnucks P. 2010 Inference of Population History by Coupling
Exploratory and Model-Driven Phylogeographic Analyses. Int. J. Mol. Sci. 11, 1190-1227
Garza J. C. and Williamson E. G. 2001 Detection of reduction in population size using data from
microsatellite loci. Mol. Ecol. 10, 305–318.
Goldstein D. B., Ruíz-Linares A., Feldman M. and Cavalli-Sforza L. L. 1995a An evaluation of
genetic distances for use with microsatellite loci. Genetics 139, 463-471.
Goldstein D. B., Ruiz Linares A., Cavalli-Sforza L. L. and Feldman M. W. 1995b Genetic
absolute dating based on microsatellites and the origin of modern humans. Proc. Natl. Acad.
Sci. 92, 6723-6727.
Goldstein D. B. and Schöletterer C. 1999 Microsatellites: Evolution and Applications. Oxford
University Press. New York.
Gotteli D., Sillero-Zuburi C., Applebaum G. D., Roy M. S., Girman D. J., García-Moreno J.,
Ostranders E. A. and Wayne R. K. 1994 Molecular genetics of the most endangered canid:
the Ethiopian wolf Canis simensis. Mol Ecol. 3, 301-312.
Goudet J., Perrin N. and Waser P. 2002 Test for sex-biased dispersal using bi-parentally
inherited genetic markers. Mol. Ecol. 11, 1103-1114.
29

Guillot G., Estoup A., Mortier F., Cosson