Created: 23 Feb 2015 | Modified: 30 Jun 2016 | BibTeX Entry | RIS Citation |
Authors: Mark E. Madsen and Carl P. Lipo
Experiment seriationct-1
is a first try at producing true frequency seriations from a regional cultural transmission model. The experiment combines:
The CT model involves:
The source code for the model is available in the seriationct repository, and is written in Python on top of the superb simuPOP population genetics simulation framework by Bo Peng. Data are stored in a standard MongoDB database, for ease of evolving the data schema as the experiment proceeds.
The CT model takes a interval-style “temporal network” model (Holme and Saramäki 2012), which describes an evolving contact network as a series of network “slices” that are valid for a duration of time. Vertices represent local populations, edges represent movement of individuals between populations, and edge weights represent the intensity of that movement and thus the potential for cultural traits to flow between subpopulations.
The overall model is composed of a series of GML files which describe weighted graphs, with textual labels uniquely identifying subpopulations that carry over (potentially) between slices. The GML files become a sequence of NetworkX
graph objects, assigned to time indices after the CT model has achieved a quasi-stationary equilibrium (“burn-in”) and before the end of the simulation run. At the time index that each slice is assigned:
The goals of seriationct-1
are twofold:
Does this combination of factors: regional, structured transmission, plus time averaged observations, plus classification of multiple trait dimensions into a design space of classes, yield the frequency patterns seen in archaeological seriations?
Given two different patterns of regional interaction, do the resulting seriations display distinct enough solution characteristics that we can identify the interaction pattern from seriation solutions?
The two regional interaction models examined here are:
For each model, we generate multiple random instances of graphs with the specified topology, so that results are independent of the particulars of a single graph realization. We then simulate cultural transmission on each temporal network instance, with replications.
For each simulation run, we gather class counts from each subpopulation existing at each simulation step after burn-in. For analysis, we aggregate all of the counts over time from each subpopulation, and then calculate the time-averaged frequency of classes. These form the “assemblages” which are input to the IDSS seriation algorithm.
The resulting output is an IDSS frequency seriation graph for each replication of each simulation run on each temporal network model.
The analysis then involves studying the graph properties of these seriation solutions, and determining whether a classifier model can accurately predict which regional interaction model generated a given seriation solution. The data set will be split 80/20 into a training and test set, balanced across the temporal network models and simulation parameters, and the ability to predict the interaction model assessed only on the test set.
Holme, Petter, and Jari Saramäki. 2012. “Temporal Networks.” Physics Reports 519 (3). Elsevier: 97–125.