BJS U.S. Department of Justice · Office of Justice Programs
Bureau of Justice Statistics
Office of Justice Programs seal

BJS home page

Contents of Homicide trends in the U.S.

Additional Information about the Data

Homicide trends in the U.S.

Weighting and Imputation Procedures for the 1976-2005 Cumulative Data File

by James Alan Fox, The Lipman Family Professor
      of Criminal Justice, Northeastern University
        and
      Marianne W. Zawitz, BJS Statistician

Most of the data used in Homicide Trends in the United States are from the FBI's Supplementary Homicide Reports (SHR) which provides detailed, incident-level data on nearly all murders and nonnegligent manslaughters in the United States. These reports include information on the month and year of an offense, on the reporting agency and its residential population, county and Metropolitan Statistical Area (MSA) codes, geographic division, and population group, on the age, race, and sex of victims and offenders, and on the victim-offender relationship, weapon use and circumstance of the crime. Except for some slight modification in 1980, the record layout and variable definitions in the SHR data have remained unchanged since 1976 when the reporting format underwent a major revision.

This document describes adjustments used in this analysis for handling missing data in the SHR that result from agency failure to file reports and incomplete records that are missing certain information about the incident, victim or offender.

Correcting for Missing Records

Law enforcement agencies voluntarily report both Uniform Crime Reports (UCR) summary data and SHR incident data to the FBI on a monthly basis. The offense data in the UCR includes counts of murder and nonnegligent manslaughter as well as seven other Index crimes. The number of murders and nonnegligent manslaughters is based on the number of victims. For the most part, each month agencies report a total number of offenses in each Index crime category. In some instances agencies may not report each month or may not report at all which results in missing data when aggregating to the State or national level. After imputing for missing data, the FBI publishes estimates for the Nation as a whole as well as for individual States for all Index offenses, including murder and nonnegligent manslaughter. (For additional information on the imputation methods used by the FBI, see "Bridging Gaps in Police Crime Data" by Michael Maltz, 10/99, NCJ 176365, http://www.ojp.usdoj.gov/bjs/abstract/bgpcd.htm.) These annual UCR State and national estimates of homicide volumes are used as benchmarks for assessing the completeness of the SHR data file and to adjust SHR victim or offender counts as needed.

Not all of the murders and nonnegligent manslaughters reported in the UCR are included in the SHR. The SHR file appears to be just over 90% complete, although, as shown in Table 1, the level of completeness of the SHR has generally diminished in recent years. To correct for missing SHR records, the national and regional totals have been benchmarked to the UCR estimates. Specifically, SHR records were weighted so that State and national total counts matched UCR estimates for murder and nonnegligent manslaughter. This weighting process assumes that the missing records are not systematically different from those available in the file. While the systematic exclusions for certain years of a State like Florida or a city like Washington, D.C. may cause some concern, the fact that missingness occurs in both large and small jurisdictions lends support for applying these benchmark weights.

Table 1. SHR Records Pertaining to Incident, Victim, and Offender Counts
  All Homicides
Murder & Nonnegligent Manslaughter
UCR
Benchmark
Weight
  SHR Data File
SHR Data File
UCR Estimated
Year Records Victims Incidents Victims Offenders Victims Offenders

1976 16,744 17,406 15,951 16,605 17,995 18,780 20,352 1.131
1977 17,825 18,586 17,277 18,032 19,228 19,120 20,388 1.060
1978 18,546 19,308 17,957 18,714 20,075 19,560 20,983 1.045
1979 20,576 21,417 19,756 20,591 22,247 21,460 23,186 1.042
1980 21,911 22,786 21,002 21,860 24,859 23,040 26,201 1.054
1981 20,152 20,931 19,284 20,053 21,794 22,520 24,475 1.123
1982 19,413 20,288 18,622 19,485 21,219 21,010 22,880 1.078
1983 18,690 19,426 17,954 18,673 20,315 19,310 21,008 1.034
1984 17,168 17,858 16,574 17,260 18,781 18,690 20,337 1.083
1985 17,348 18,131 16,763 17,545 18,831 18,980 20,371 1.082
1986 19,089 19,849 18,510 19,257 20,722 20,610 22,178 1.070
1987 17,747 18,509 17,205 17,963 19,396 20,100 21,703 1.119
1988 17,846 18,546 17,277 17,971 19,841 20,680 22,832 1.151
1989 18,812 19,588 18,184 18,952 20,898 21,500 23,708 1.134
1990 20,404 21,246 19,451 20,273 22,889 23,440 26,465 1.156
1991 21,817 22,656 20,863 21,676 24,807 24,700 28,268 1.140
1992 22,753 23,793 21,701 22,716 25,382 23,760 26,549 1.046
1993 23,320 24,336 22,175 23,180 26,116 24,530 27,637 1.058
1994 22,231 23,246 21,093 22,084 25,075 23,330 26,490 1.056
1995 20,099 21,193 19,154 20,232 22,667 21,610 24,211 1.068
1996 17,053 17,829 16,203 16,967 19,350 19,650 22,410 1.158
1997 15,929 16,726 15,052 15,836 17,918 18,210 20,604 1.150
1998 14,313 14,975 13,556 14,209 16,177 16,970 19,320 1.194
1999 12,792 13,511 12,299 13,011 14,587 15,522 17,402 1.193
2000 13,220 13,856 12,597 13,230 15,059 15,586 17,741 1.178
2001 14,239 17,695 13,369 14,080 15,949 16,037 18,166 1.140
2002 14,252 15,057 13,481 14,274 16,075 16,204 18,249 1.135
2003 14,361 15,232 13,584 14,436 16,083 16,528 18,414 1.145
2004 14,198 14,946 13,435 14,164 15,972 16,148 18,209 1.139
2005 14,772 15,573 14,089 14,881 17,052 16,692 19,127 1.122
1976-2005 537,620 564,499 514,418 538,210 597,359 594,277 659,862  

Correcting for Incomplete Records

Even for the 90% of SHR records that are available for analysis, certain variables have nontrivial rates of missingness. At one extreme, characteristics of an agency (e.g., region, population group) are always complete. Victim data, though sometimes missing, are absent at such a low rate that standard approaches for handling missing data (specifically, listwise deletion) hardly bias analyses of patterns and trends in victimization. Specifically, as shown in the top panel of Table 2, victim age is missing in 1.72% of homicides, race in 1.04% and sex in 0.14%. Overall, 2.49% are missing on at least one of these measures.

At the other extreme, the percentage of missing data is greater for homicide offenders than victims. A significant problem in using SHR data to analyze offender characteristics is the sizable and growing number of unsolved homicides contained in the data file. Overall, 26 percent of the SHR offender records describe the perpetrator as unknown (based on situation codes), and this percentage has grown from just under 20 percent in 1976 to nearly 30 percent by the mid-1990s. Even when the offender is known to the police, not all characteristics about the offender may be reported. Table 2 shows specifically that as many as 26.48% of offender records contain no information about the perpetrators, and 31.6% of records are not complete in terms of the assailant's age, race and sex.

Table 2. Patterns of Missingness in Victim and Offender Data in the Supplementary Homicide File

● Data available | ○ Data missing
  Characteristic    
  Age Race Sex Cases Percent
Victim File          
None missing 524,787 97.51%
Sex missing 51 0.01%
Race missing 4,008 0.74%
Race and sex missing 84 0.02%
Age missing 7,740 1.44%
Age and sex missing 19 0.00%
Age and race missing 936 0.17%
All missing 585 0.11%
    Total       538,210 100.00%

Offender File          
None missing 408,573 68.40%
Sex missing 37 0.01%
Race missing 2,942 0.49%
Race and sex missing 209 0.03%
Age missing 24,880 4.16%
Age and sex missing 141 0.02%
Age and race missing 2,382 0.40%
All missing 158,195 26.48%
   Total       597,359 100.00%
With complete victim data          
None missing 403,209 67.50%
Sex missing 33 0.01%
Race missing 1,557 0.26%
Race and sex missing 195 0.03%
Age missing 23,943 4.01%
Age and sex missing 134 0.02%
Age and race missing 2,194 0.37%
All missing 151,520 25.36%
With incomplete victim data          
None missing 5,364 0.90%
Sex missing 4 0.00%
Race missing 1,385 0.23%
Race and sex missing 14 0.00%
Age missing 937 0.16%
Age and sex missing 7 0.00%
Age and race missing 188 0.03%
All missing 6,675 1.12%
   Total       597,359 100.00%

Ignoring unsolved homicides and missing offender data seriously understates calculated rates of offending overall and by particular subgroups of the population, distorts trends over time among these same subgroups, and biases observed patterns of offending to the extent that the likelihood of missingness of offender data is associated with offender characteristics.

While it is not possible to determine directly whether case solution and thus missingness in offender data are associated with offender characteristics themselves, some indication about the pattern of missingness can be derived from examining the extent to which the likelihood of case solution is related to victim and incident variables. As shown in Table 3, case solution rates are lowest for homicides against young adult victims as well as for elderly victims. Solution rates are also lower for incidents involving black or male victims. As expected, solution rates decrease with increasing population size and urbanness. In part as a consequence of urbanicity differences, solution rates in the South are much higher than other regions. Finally, whether or not a gun is used to commit the homicide does not appear to impact upon rates of case solution.

Table 3. Case Solution Rates by Victim, Location, and Incident Characteristics
    Solved Cases
Unsolved Cases
Variable Category Number Percent Number Percent
Victim Age 0-13 23,881 87.09% 3,541 12.91%
  14-17 20,766 70.94% 8,507 29.06%
  18-24 94,147 67.83% 44,643 32.17%
  25-34 115,389 69.13% 51,530 30.87%
  35-49 95,585 72.41% 36,417 27.59%
  50-64 38,604 71.49% 15,398 28.51%
  65+ 21,396 68.95% 9,634 31.05%
Victim Race White 214,126 72.69% 80,432 27.31%
  Black 186,740 68.53% 85,743 31.47%
  Other 8,902 71.81% 3,494 28.19%
Victim Sex Male 306,815 69.19% 136,617 30.81%
  Female 102,953 75.70% 33,053 24.30%
Location Large city 126,338 62.70% 75,166 37.30%
  Medium City 90,221 68.79% 40,934 31.21%
  Small city 52,078 77.82% 14,841 22.18%
  Suburban 89,997 74.37% 31,020 25.63%
  Rural 51,134 86.90% 7,708 13.10%
Region Northeast 61,936 63.17% 36,118 36.83%
  Midwest 77,558 69.44% 34,131 30.56%
  South 180,551 76.20% 56,379 23.80%
  West 89,723 67.58% 43,041 32.42%
Weapon Nongun 148,538 72.05% 57,629 27.95%
  Gun 261,230 69.98% 112,041 30.02%

A weighting strategy based on available information about the victims (age, race and sex) murdered in both solved and unsolved homicides is used to adjust for missing offender data in Homicide Trends in the United States. Through this imputation algorithm, the demographic characteristics of unidentified offenders are inferred on the basis of similar homicide cases--similar in terms of the victim and incident profile--that had been solved. In other words, offender profiles for unsolved crimes are estimated based on the offender profiles in solved cases matched on victim age, sex and race, region, urbanness, weapon and circumstances.

The weighting procedure is accomplished by establishing adjustment groups within a large, multidirectional cross tabulation. For each cell, we tally the number of offenders (Nc) and the number having complete offender data (nc), and use as a weight the inverse proportion of complete cases:

Formula - the weight is equal to the number of offenders divided by the number of offender cases with compete data

Next these adjustment cell weights are applied to the offender records based on their cell membership (i.e., based on victim age, race, sex, location type, weapon and circumstances) and whether or not the offender information is complete. That is,

Formula - the weight used is either the inverse proportion of complete cases (Wc) or zero if the offender data are incomplete

Finally, the weights are adjusted or "raked" post-stratification so that marginal weighted counts by year and State match fixed values.

This weighting approach is applied to the entire offender record so that cases with missing offender age, race or sex are excluded by virtue of their zero case weights. As a consequence, partial offender information is discarded, causing some slight inefficiency in the approach. While it would be possible to retain partial offender information, this would require separate weights for each offender characteristic. Finally, all non-zero weights are further increased slightly to account for the small percentage of cases unassigned to any adjustment cell because of their being missing on one or more victim characteristic.

In any analysis of weighted data, offenders with incomplete age, race or sex information are dropped due to assigned zero weights. Offenders with complete age, race and sex information all have weights at or above 1.0, and become proxies for excluded cases, matched on victim characteristics, State, and year. Thus, for example, an offender with an imputation weight of 2.0 would count in any analysis as if he/she were two offenders. The entire distribution of the weighting variable is shown in Table 4.

Table 4. Distribution of Imputation Weights
  Cases
Weight Number Percent
Exactly 0 194,111 32.49%
1.00 - 1.99 324,296 54.29%
2.00 - 2.99 61,374 10.27%
3.00 - 3.99 13,176 2.21%
4.00 - 4.99 3,357 0.56%
5.00 - 5.99 1,036 0.17%
6.00 - 6.99 4 0.00%
7.00 - 7.99 2 0.00%
8.00 + 3 0.00%
Total 597,359 100.00%
Mean 1.105  
Standard deviation 0.924  
Median 1.225  
75th %tile 1.597  
90th %tile 2.176  
95th %tile 2.655  
99th %tile 3.748  
Maximum 16.070  

Table 5 demonstrates the impact of applying the imputation weights by comparing the distribution of offender characteristics (age, race and sex) using the adjustment weights with those using listwise deletion of missing data. The approach boosts the abundance of youthful offenders; offenders under the age of 25 represent 47.5% of the imputed distribution, compared to 45.0% for the distribution using listwise deletion. The percentage of black offenders grows to 52.2%, compared to 51.2% without weighting. The sex distribution, which by any measure overwhelmingly favors male perpetrators, shifts slightly from 88.2% to 88.8% (clearly indicating a ceiling effect governing this percentage).

Table 5. Offender characteristics without and with imputation weights
    Without Imputation
With Imputation Weights
Offender Characteristics      Number Percent Adjusted Percent      Number Percent
Offender Age



     
  Under 14 2,203 0.33% 0.48% 3,101 0.47%
  14-17 44,107 6.68% 9.70% 68,599 10.40%
  18-24 158,416 24.01% 34.85% 241,414 36.59%
  25-34 130,924 19.84% 28.80% 187,683 28.44%
  35-49 84,013 12.73% 18.48% 113,931 17.27%
  50-64 26,065 3.95% 5.73% 33,861 5.13%
  65+ 8,798 1.33% 1.94% 11,272 1.71%
  Missing 205,336 31.12%      
  Total 659,862 100.00% 100.00% 659,862 100.00%
Offender Race          
  White 224,257 33.99% 46.83% 301,047 45.77%
  Black 244,994 37.13% 51.16% 344,313 52.18%
  Other 9,641 1.46% 2.01% 13,501 2.05%
  Missing 180,970 27.43%      
  Total 659,862 100.00% 100.00% 659,862 100.00%
Offender Sex          
  Male 427,221 64.74% 88.16% 585,856 88.78%
  Female 57,374 8.69% 11.84% 74,006 11.22%
  Missing 175,266 26.56%      
  Total 659,862 100.00% 100.00% 659,862 100.00%
BJS home page | Top of this page

Bureau of Justice Statistics
www.ojp.usdoj.gov/bjs/

Send comments to askbjs@usdoj.gov

 

OJP Freedom of Information Act page
Privacy Policy
Page last revised on July 11, 2007