Appendix C — Data extraction keys

Table C.1: Factor keys following EML format.
attributeName code definition
also_phys complex Not equivocally true or false; see comments for additional information.
also_phys FALSE FALSE
also_phys TRUE TRUE
claw_regen complex Not equivocally true or false; see comments for additional information.
claw_regen FALSE FALSE
claw_regen TRUE TRUE
code_avail complex Not equivocally true or false; see comments for additional information.
code_avail FALSE FALSE
code_avail TRUE TRUE
coords_avail complex Not equivocally true or false; see comments for additional information.
coords_avail FALSE FALSE
coords_avail TRUE TRUE
data_avail complex Not equivocally true or false; see comments for additional information.
data_avail FALSE FALSE
data_avail TRUE TRUE
est_variance complex Not equivocally true or false; see comments for additional information.
est_variance FALSE FALSE
est_variance TRUE TRUE
habitat brackish Lives in environments with intermediate or fluctuating salinities, including estuaries and mangroves
habitat fresh Lives in freshwater environments
habitat marine Lives in marine environments
habitat semi-terrestrial Lives partially on land (or in non-submerged trees) and partially in an aquatic environment
habitat terrestrial Lives on land
log_trans complex Not equivocally true or false; see comments for additional information.
log_trans FALSE FALSE
log_trans TRUE TRUE
mgmt_context complex Not equivocally true or false; see comments for additional information.
mgmt_context FALSE FALSE
mgmt_context TRUE TRUE
morph_v_phys conflict Morphometric SM50 was higher than some functional/physiological SM50 values and lower than others. Other complex situations that do not fall into one of the other four categories can be labeled as this and expanded upon in the comments column.
morph_v_phys higher Morphometric SM50 was higher than values provided for SM50 based on a measure of physiological or functional maturity.
morph_v_phys lower Morphometric SM50 was lower than values provided for SM50 based on a measure of physiological or functional maturity.
morph_v_phys none No comparison between morphometric SM50 and other maturity types was made.
morph_v_phys same Morphometric SM50 was roughly the same as values provided for SM50 based on a measure of physiological or functional maturity.
mult_models complex Not equivocally true or false; see comments for additional information.
mult_models FALSE FALSE
mult_models TRUE TRUE
novel_stats complex Not equivocally true or false; see comments for additional information.
novel_stats FALSE FALSE
novel_stats TRUE TRUE
reg_mention complex Not equivocally true or false; see comments for additional information.
reg_mention FALSE FALSE
reg_mention TRUE TRUE
sex B Estimates made for both males and females, but separately and possibly using different allometric variables,
sex C SM50 estimate made using combined measurements of the same allometric variables from both sexes
sex F Females only
sex M Males only
sex NS Not specified
spatial_var complex Not equivocally true or false; see comments for additional information.
spatial_var FALSE FALSE
spatial_var TRUE TRUE
success complex Not equivocally true or false; see comments for additional information.
success FALSE FALSE
success TRUE TRUE
time_var complex Not equivocally true or false; see comments for additional information.
time_var FALSE FALSE
time_var TRUE TRUE
f_ab_slope; m_claw_slope; f_ab_int; m_claw_int inc Increased at maturity
f_ab_slope; m_claw_slope; f_ab_int; m_claw_int dec Decreased at maturity
f_ab_slope; m_claw_slope; f_ab_int; m_claw_int none No significant change at maturity
f_ab_slope; m_claw_slope; f_ab_int; m_claw_int tri Triphasic (2 distinct maturity-related transitions)
f_ab_slope; m_claw_slope; f_ab_int; m_claw_int NA Not discussed or not applicable for the modeling approach used
terminal_molt FALSE FALSE
terminal_molt TRUE TRUE
terminal_molt complex Not equivocally true or false; see comments for additional information.
heterochely FALSE FALSE
heterochely TRUE TRUE
heterochely complex Not equivocally true or false; see comments for additional information.
Table C.2: Standardized measurement codes to be used during data extraction.
code definition
ca_w Carapace width, not otherwise specified.
long_ca_w Long carapace width: maximum carapace width including spines
short_ca_w Short carapace width: maximum carapace width excluding spines. Measurements of internal carapace width are assumed to be of this type.
ca_len Carapace length, not otherwise specified.
long_ca_len Long carapace length: distance from the tips of the two frontal teeth to the extreme posterior margin of the carapace, excluding curvature of the carapace.
short_ca_len Short carapace length: distance from the base of the two frontal teeth to the extreme posterior margin of the carapace, excluding curvature of the carapace.
fo_w Fronto-orbital width: distance between the outer angles of the orbits.
f_w Frontal width: distance across the portion of the carapace between but not including the orbital area.
ab_w Abdomen width, not otherwise specified.
ab_w2 Width of 2nd abdominal segment
ab_w3 Width of 3rd abdominal segment
ab_w4 Width of 4th abdominal segment
ab_w5 Width of 5th abdominal segment
ab_w6 Width of 6th abdominal segment
ab_w7 Width of 7th abdominal segment
ab_len Abdomen length.
bod_h Body height (depth): Distance between the highest part of the dorsal metagastric region and a point along the ventral median line just anterior to the flexed abdomen.
cheliped_len Cheliped length: Distance from the articulation of the coxa with the sternum to the most distal part of the propodus OR not otherwise specified.
cheliped_w Cheliped width, not otherwise specified.
cp_len Chela propodus (=manus) length. Distance from the tip of the propodus (sometimes tip of dactyl) to the articulation with the carpus. If not otherwise specified, chela/claw length is assumed to refer to this.
cp_w Chela propodus width (breadth). If not otherwise specified, chela/claw width is assumed to refer to this. Includes measurements referring to chela palm width/breath.
cp_h Chela propodus height (depth). Distance between the long dorsal spine and the base of the propodus. Includes variables described as chela palm height or depth. If not otherwise specified, chela/claw height assumed to refer to this.
ch_palm_len Chela palm length. Diagonally across chela palm (as in Conan & Comeau, 1986) or not otherwise specified.
merus_len Merus length, variously defined. Wallace (1949): Measured on a diagonal from the most posterior dorsal point of articulation of merus and carpus to the most anterior dorsal margin of articulation between the merus and preceding segment, whereas Finney (1981) measured on a straight line, not diagonal.
merus_w Merus width, variously defined.
leg_len Leg length, variously defined. E.g., Kulmiye et al. (2006): from the tip of the dactyl to the proximal margin of ischium.
head_w Head width
head_len Head length
head_h Head height
bod_len Body length
ww Wet weight
dw Dry weight
Table C.3: Standardized model codes to be used during data extraction.
code definition lines_connect parallel_ok logistic_reg_included notes
BSL Broken-stick model with logistic transition Yes No Yes NA
CLASS Separate linear regression lines fit to already-classified points Varies Varies Varies - CLASS-INT: intersection of regression lines - CLASS-SEP: separate lines not specified to intersect - CLASS-LOG: if overlap b/w two lines, used logistic regression to find breakpoint
CLU Clustering (without discriminant analysis) NA NA Yes (indirectly) Includes kmeans, pam, hkmeans, dbscan, hdbscan, mixture models, etc.
CM Cubic NA NA No NA
DIS Discriminant analysis NA NA Yes (indirectly) Includes LDA and QDA. Can be preceded by PCA (principle component analysis = principal factor analysis = major axis regression) and/or preceded by any clustering method.
DID distribution interval derived (DID) maturity cutline No NA Yes (indirectly) Richar & Foy (2022)
GAM Generalized additive model or other form of smoothing splines NA NA No NA
INFL_PT Based on inflection point or dividing line between frequency modes of a CH/CW ratio histogram NA NA Yes (indirectly) NA
INT Intersecting Yes No No, intersection is SMM Two separate lines are plotted for known mature and known immature individuals (like with Somerton method), and t-test used to compare slopes b/w lines. If slopes diff, SM50 is taken to be the intersection of the two lines.
LM Linear NA NA No NA
MIX Mixture modeling No Yes Yes (indirectly) NA
PU Piecewise linear unclear/not specified Varies Varies No The description is either unclear or contradictory; for example, citing Somerton 1980b but the description is of TL_even
QM Quadratic NA NA No NA
RATIO Based on a change in CH/CW ratio using a method other than the inflection point method NA NA Yes (indirectly) NA
SR Broken-stick/segmented regression Yes No No, breakpoint is SMM NA
TL Two-line model no transition (lines start/end at breakpoint) No Yes No, breakpoint is SMM TL_obs = iterates over observed values, TL_even = iterates over evenly-spaced values b/w lower and upper limit of unknown range
TLL Two-line model with logistic transition Yes Yes Yes NA
TLS Two-line Somerton method (lines can overlap) No Yes Yes (indirectly) NA
TLS2 Somerton method where lines are tested for a sig. diff. before classifying the unknown points No Yes Yes (indirectly) NA
VE (B) Visual estimation of allometric breakpoint Varies Yes No NA
VE (D) Visual estimation of discriminant function NA NA Yes (indirectly) NA

C.1 Measurement Notes

  1. Prefixes: l_ = left, r_ = right, d_ = dominant, m_ = minor. pre_ = pre-molt, post_ = post-molt, o_ = original, reg_ = regenerated. Use dominant and minor instead of left/right where possible
  2. “Claw” is assumed to mean chela rather than the entire cheliped
  3. Variables should only be recorded if they are used to (attempt to) estimate size at maturity. For example, crab weight may be recorded for general growth modeling but should not be listed as a variable unless changes in the relationship between weight and a different variable were used to estimate size at maturity