Data Notification
Bird point IDs within grids were transposed resulting in inaccurate point-level spatial data
November 13, 2023
Grid point information in the Breeding landbird point counts data product (DP1.10003.001) have been incorrectly reported for two-thirds of grid points due to a transposition error when converting numeric point labels (1-9) into alphanumeric point IDs (A1-C3) during data ingest (see Figure 1 below that illustrates the transposition). This conversion was implemented to maintain the NEON alphanumeric data format while accommodating the request from the field observers to use the numeric system, but the conversion was unfortunately implemented incorrectly in the data ingest system. This applies to all years (2013-2022) at all 32 NEON sites where point counts are collected in 9-point grids (BARR, BART, BONA, CLBJ, CPER, DEJU, DSNY, GRSM, GUAN, HARV, HEAL, JERC, JORN, KONZ, MLBS, MOAB, NIWO, OAES, ONAQ, ORNL, OSBS, RMNP, SJER, SRER, STEI, TALL, TEAK, UKFS, UNDE, WOOD, WREF, YELL). Point counts at the remaining 15 sites are dispersed among distributed base plots and are not impacted.
Spatial data published in the brd_perpoint table are for the centroid of the grid only and are not impacted. These non-impacted data include latitude and longitude, National Land Cover Database (NLCD) vegetation class, and elevation. Spatial data associated with the individual points within the 9-point grids are accessed via the NEON API, the terrestrial sampling locations shapefile, or the geoNEON R package. The point-level spatial data are accurate on their own, but the mislabeling of the point IDs in the count data will have resulted in inaccuracies if the spatial and count data were joined. Users who have accessed grid point spatial data from these sources and joined it to the published count data should update their analyses using the corrected data product.
All transposed grid point ID values have been corrected as of 11/13/2023. As an additional measure to eliminate the possibility for incorrect transposition in the future, we have converted all alphanumeric point IDs to numeric point IDs that align with the grid labeling convention used in the field (Figure 2). The change will be reflected in Provisional data (2022 – 2023 data) starting in mid-November 2023; it will appear in the 2013 – 2021 data upon publication of the 2024 Release in January 2024. RELEASE-2021, -2022, and -2023 data contain the inaccurate alphanumeric point ID labels and will not change as releases are intended to be static.
Wherever possible, we encourage data users to use RELEASE-2024 and later to avoid this problem. If data from RELEASE-2021, -2022, or -2023 are used and point-level spatial resolution is needed, the following R code snippet can be used to convert the grid point ID values to the numeric convention now in use and correct for the transposition.
point_conversion <- data.frame(point=c("A1","A2","A3",
"B1","B2","B3","C1","C2","C3"),
point_corrected=c(1,4,7,2,5,8,3,6,9))
brd_perpoint$pointID <-
stringi::stri_replace_all_fixed(str=brd_perpoint$pointID,
pattern=point_conversion$point,
replacement=point_conversion$point_corrected,
vectorize_all=F)
brd_perpoint$eventID <-
stringi::stri_replace_all_regex(str=brd_perpoint$eventID,
pattern=point_conversion$point,
replacement=point_conversion$point_corrected,
vectorize_all=F)
brd_countdata$pointID <-
stringi::stri_replace_all_fixed(str=brd_countdata$pointID,
pattern=point_conversion$point,
replacement=point_conversion$point_corrected,
vectorize_all=F)
brd_countdata$eventID <-
stringi::stri_replace_all_regex(str=brd_countdata$eventID,
pattern=point_conversion$point,
replacement=point_conversion$point_corrected,
vectorize_all=F)
Please use the Contact Us form for further inquiries related to this issue; select DP1.10003.001 in the Data Product dropdown list and include the term "INC0069692".
Figure 1. (A) Bird point grid that was used for field data collection including numeric and alphanumeric point IDs, where, for example, point count data collected at numeric value 3 correspond to alphanumeric value A3 (upper right grid corner). Gray circles indicate the search area, and the green circle indicates the centroid of a collocated Distributed base plot, where other NEON data are collected. (B) Inaccurate bird point grid used upon data ingest prior to 2023 including numeric and alphanumeric point IDs, where, for example, point count data collected at numeric value 3 are inaccurately associated with spatial information for alphanumeric value C1 (lower left grid corner). Point IDs were transposed for all points except for A1, B2, and C3.
Figure 2. Updated bird grid point labels containing only numeric values. This labeling system will be applied in Provisional data starting in mid-November 2023 and in Release data in the upcoming 2024 data release.