Data Profile

Published

May 8, 2025

Modified

May 17, 2025

LDI


Gold

  • rows/ records: 1,310
  • distinct rows/ records: 1,310
  • there are duplicate records: FALSE
  • cols: 49
  • cols with 100% missing values: 0
  • date cols: 2

Details: missing and distinct

Skim

Data summary
Name subset(df_tmp, select = s…
Number of rows 1310
Number of columns 49
_______________________
Column type frequency:
character 39
Date 3
list 4
numeric 3
________________________
Group variables None

Variable type: character

skim_variable n_missing complete_rate min max empty n_unique whitespace
award_number 0 1.00 11 56 0 1304 0
award_number_clean 0 1.00 7 16 0 1310 0
parent_award_id_piid 1308 0.00 13 13 0 2 0
status 0 1.00 4 8 0 3 0
award_type 0 1.00 5 8 0 2 0
name 0 1.00 8 212 0 1288 0
awardee 0 1.00 4 65 0 343 0
project_type 0 1.00 5 35 0 17 0
principal_investigator 29 0.98 5 28 0 911 0
awarding_agency_code 45 0.97 2 2 0 1 0
awarding_agency_name 45 0.97 23 23 0 1 0
awarding_sub_agency_code 45 0.97 4 4 0 1 0
awarding_sub_agency_name 45 0.97 23 23 0 1 0
recipient_uei 45 0.97 12 12 0 339 0
recipient_name 35 0.97 4 73 0 340 0
recipient_parent_uei 600 0.54 12 12 0 240 0
recipient_parent_name 341 0.74 4 64 0 254 0
recipient_country_code 0 1.00 3 3 0 2 0
recipient_country_name 0 1.00 6 13 0 2 0
recipient_county_name 5 1.00 3 25 0 161 0
recipient_city_name 2 1.00 4 22 0 216 0
recipient_state_code 1 1.00 2 2 0 50 0
recipient_state_name 1 1.00 4 24 0 50 0
recipient_zip_code 1 1.00 5 5 0 320 0
recipient_county_fips_code 5 1.00 5 5 0 171 0
recipient_state_fips_code 1 1.00 5 5 0 50 0
primary_place_of_performance_country_code 45 0.97 3 3 0 2 0
primary_place_of_performance_country_name 45 0.97 6 13 0 2 0
primary_place_of_performance_city_name 49 0.96 4 22 0 206 0
primary_place_of_performance_county_name 51 0.96 3 25 0 159 0
primary_place_of_performance_state_name 46 0.96 4 24 0 51 0
primary_place_of_performance_zip_code 46 0.96 5 5 0 313 0
primary_place_of_performance_county_fips_code 46 0.96 5 5 0 171 0
primary_place_of_performance_state_fips_code 46 0.96 5 5 0 51 0
usaspending_permalink 45 0.97 60 81 0 1265 0
recipient_congressional_district 122 0.91 4 8 0 163 0
recipient_congressional_member 122 0.91 7 30 0 163 0
primary_place_of_performance_congressional_district 180 0.86 4 8 0 159 0
primary_place_of_performance_congressional_member 180 0.86 7 30 0 159 0

Variable type: Date

skim_variable n_missing complete_rate min max median n_unique
award_date_range_start 15 0.99 2016-01-01 2025-07-01 2020-07-01 159
award_date_range_end 15 0.99 2016-10-31 2029-09-30 2024-08-31 157
as_of 0 1.00 2025-03-08 2025-03-08 2025-03-08 1

Variable type: list

skim_variable n_missing complete_rate n_unique min_length max_length
office 0 1.00 7 1 5
program 0 1.00 28 1 2
program_topics 347 0.74 55 0 4
evaluation_topics 1297 0.01 8 0 3

Variable type: numeric

skim_variable n_missing complete_rate mean sd p0 p25 p50 p75 p100 hist
amount 0 1 1907142.67 2455245.1 55481 756700.8 1399620 2870246 45450138 ▇▁▁▁▁
award_year 0 1 2019.82 2.5 2016 2018.0 2020 2022 2024 ▇▇▅▇▆
state_population 3 1 14293808.50 11850218.5 578759 6171374.0 9295227 19867248 39521958 ▇▆▂▁▂

Duplicate IDs

[1] "No duplicate IDs"