Data Profile

Published

April 17, 2025

Modified

May 17, 2025

LDI


Silver

  • rows/ records: 2,445
  • distinct rows/ records: 2,445
  • there are duplicate records: FALSE
  • cols: 49
  • cols with 100% missing values: 0
  • date cols: 2

Details: missing and distinct

Skim

Data summary
Name subset(df_tmp, select = s…
Number of rows 2445
Number of columns 49
_______________________
Column type frequency:
character 39
Date 3
list 4
numeric 3
________________________
Group variables None

Variable type: character

skim_variable n_missing complete_rate min max empty n_unique whitespace
award_number 0 1.00 10 56 0 2436 0
award_number_clean 0 1.00 7 18 0 2445 0
parent_award_id_piid 2443 0.00 13 13 0 2 0
status 0 1.00 4 8 0 3 0
award_type 0 1.00 5 8 0 2 0
name 0 1.00 7 212 0 2412 0
awardee 0 1.00 4 65 0 364 0
project_type 7 1.00 5 35 0 19 0
principal_investigator 62 0.97 5 30 0 1406 0
awarding_agency_code 838 0.66 2 2 0 1 0
awarding_agency_name 838 0.66 23 23 0 1 0
awarding_sub_agency_code 838 0.66 4 4 0 1 0
awarding_sub_agency_name 838 0.66 23 23 0 1 0
recipient_uei 838 0.66 12 12 0 361 0
recipient_name 828 0.66 4 73 0 360 0
recipient_parent_uei 1424 0.42 12 12 0 270 0
recipient_parent_name 1154 0.53 4 73 0 276 0
recipient_country_code 0 1.00 3 3 0 3 0
recipient_country_name 0 1.00 6 13 0 3 0
recipient_county_name 30 0.99 3 25 0 170 0
recipient_city_name 4 1.00 4 22 0 227 0
recipient_state_code 3 1.00 2 2 0 50 0
recipient_state_name 3 1.00 4 24 0 50 0
recipient_zip_code 3 1.00 5 5 0 341 0
recipient_county_fips_code 30 0.99 5 5 0 182 0
recipient_state_fips_code 3 1.00 5 5 0 50 0
primary_place_of_performance_country_code 838 0.66 3 3 0 3 0
primary_place_of_performance_country_name 838 0.66 6 13 0 3 0
primary_place_of_performance_city_name 842 0.66 4 22 0 218 0
primary_place_of_performance_county_name 845 0.65 3 25 0 168 0
primary_place_of_performance_state_name 840 0.66 4 24 0 51 0
primary_place_of_performance_zip_code 840 0.66 5 5 0 334 0
primary_place_of_performance_county_fips_code 840 0.66 5 5 0 183 0
primary_place_of_performance_state_fips_code 840 0.66 5 5 0 51 0
usaspending_permalink 838 0.66 60 81 0 1607 0
recipient_congressional_district 225 0.91 4 8 0 164 0
recipient_congressional_member 225 0.91 7 30 0 164 0
primary_place_of_performance_congressional_district 1014 0.59 4 8 0 160 0
primary_place_of_performance_congressional_member 1014 0.59 7 30 0 160 0

Variable type: Date

skim_variable n_missing complete_rate min max median n_unique
award_date_range_start 608 0.75 15-10-01 2025-07-01 2018-08-01 281
award_date_range_end 608 0.75 2005-04-01 2029-09-30 2022-12-31 284
as_of 0 1.00 2025-03-08 2025-03-08 2025-03-08 1

Variable type: list

skim_variable n_missing complete_rate n_unique min_length max_length
office 0 1.00 7 1 5
program 0 1.00 35 1 2
program_topics 497 0.80 60 0 4
evaluation_topics 2427 0.01 9 0 3

Variable type: numeric

skim_variable n_missing complete_rate mean sd p0 p25 p50 p75 p100 hist
amount 0 1.00 2505372.06 20366627.68 0 851822 1446527 2763929 1000000000 ▇▁▁▁▁
award_year 18 0.99 2015.37 5.68 1999 2011 2016 2020 2024 ▁▃▅▇▇
state_population 24 0.99 13775481.66 11476487.79 568502 6003323 8882190 19589572 39521958 ▇▅▂▁▂

Duplicate IDs

[1] "No duplicate IDs"