Oct 30, 2019 11 min read

Military Strength Methodology
U.S. Army Spc. Ian Kenny, a fire control specialist with 2nd Battalion, 4th Field Artillery Regiment, 75th Field Artillery Brigade, throws a RQ-11 Raven System into the sky. U.S. Army photo by Sgt. Dustin D. Biven

The assessment portion of the Index of U.S. Military Strength is composed of three major sections that address America’s military power, the environments within or through which it must operate, and threats to U.S. vital national interests.

The authors of this study used a five-category scoring system that ranged from “very poor” to “excellent” or “very weak” to “very strong” as appropriate to each topic. This particular approach was selected to capture meaningful gradations while avoiding the appearance that a high level of precision was possible given the nature of the issues and the information that was publicly available.

Some factors are quantitative and lend themselves to discrete measurement; others are very qualitative in nature and can be assessed only through an informed understanding of the material that leads to a judgment call. Further, conditions in each of the areas assessed are changing throughout the year, so any measurement is based on the information at hand and must necessarily be viewed as a snapshot in time. While this is not entirely satisfactory when it comes to reaching conclusions on the status of a given matter, especially the adequacy of military power, and will doubtless be quite unsatisfactory for some readers, we understand that senior officials in decision-making positions will never have a comprehensive set of inarguable hard data on which to base a decision.

Purely quantitative measures alone tell only part of the story when it comes to the relevance, utility, and effectiveness of hard power. In fact, using only quantitative metrics to assess military power or the nature of an operating environment can lead to misinformed conclusions. Raw numbers are a very important component, but they tell only a part of the story of war. Similarly, experience and demonstrated proficiency are often decisive factors in war, but they are nearly impossible to measure.

In assessing the global operating environment, this Index focused on three key regions—Europe, the Middle East, and Asia—because of their importance relative to U.S. vital security interests.

For threats to U.S. vital interests, the Index identifies the countries that pose the greatest current or potential threats to U.S. vital interests based on two overarching factors: the behavior and capability of those countries. The classic definition of “threat” considers the combination of intent and capability, but intent is not susceptible to clear measurement, so “observed behavior” is used as a reasonable surrogate because it is the clearest manifestation of intent. The selection of threat countries is based on their historical behavior and explicit policies or formal statements vis-à-vis U.S. interests, scoring them in two areas: the degree of provocative behavior that they exhibited during the year and their ability to pose a credible threat to U.S. interests irrespective of intent.

Finally, the status of U.S. military power is addressed in three areas: capability (or modernity), capacity, and readiness. All three are fundamental to success even if they are not de facto determinants of success (something we explain further in the section). Also addressed is the condition of the United States’ nuclear weapons capability, assessing it in areas that are unique to this military component and critical to understanding its real-world viability and effectiveness as a strategic deterrent.

Assessing the Global Operating Environment

Not all of the factors that characterize an operating environment are equal, but each contributes to the degree to which a particular operating environment is favorable or unfavorable to future U.S. military operations. Our assessment of the operating environment employed a five-point scale ranging from “very poor” to “excellent” conditions and covering four regional characteristics of greatest relevance to the conduct of military operations:

  1. Very Poor. Significant hurdles exist for military operations. Physical infrastructure is insufficient or nonexistent, and the region is politically unstable. The U.S. military is poorly placed or absent, and alliances are nonexistent or diffuse.
  2. Unfavorable. A challenging operating environment for military operations is marked by inadequate infrastructure, weak alliances, and recurring political instability. The U.S. military is inadequately placed in the region.
  3. Moderate. A neutral to moderately favorable operating environment is characterized by adequate infrastructure, a moderate alliance structure, and acceptable levels of regional political stability. The U.S. military is adequately placed.
  4. Favorable. A favorable operating environment includes good infrastructure, strong alliances, and a stable political environment. The U.S. military is well placed in the region for future operations.
  5. Excellent. An extremely favorable operating environment includes well-established and well-maintained infrastructure; strong, capable allies; and a stable political environment. The U.S. military is exceptionally well placed to defend U.S. interests.

The key regional characteristics consisted of:

  1. Alliances. Alliances are important for interoperability and collective defense as allies would be more likely to lend support to U.S. military operations. Various indicators provide insight into the strength or health of an alliance. These include whether the U.S. trains regularly with countries in the region, has good interoperability with the forces of an ally, and shares intelligence with nations in the region.
  2. Political Stability. Political stability brings predictability for military planners when considering such things as transit, basing, and overflight rights for U.S. military operations. The overall degree of political stability indicates whether U.S. military actions would be hindered or enabled and considers, for example, whether transfers of power in the region are generally peaceful and whether there have been any recent instances of political instability in the region.
  3. U.S. Military Positioning. Having military forces based or equipment and supplies staged in a region greatly facilitates the ability of the United States to respond to crises and, presumably, achieve successes in critical “first battles” more quickly. Being routinely present in a region also assists in maintaining familiarity with its characteristics and the various actors that might try to assist or thwart U.S. actions. With this in mind, we assessed whether or not the U.S. military was well positioned in the region. Again, indicators included bases, troop presence, prepositioned equipment, and recent examples of military operations (including training and humanitarian) launched from the region.
  4. Infrastructure. Modern, reliable, and suitable infrastructure is essential to military operations. Airfields, ports, rail lines, canals, and paved roads enable the U.S. to stage, launch, and logistically sustain combat operations. We combined expert knowledge of regions with publicly available information on critical infrastructure to arrive at our overall assessment of this metric.

Assessing Threats to U.S. Vital Interests

To make the threats identified herein measurable and relatable to the challenges of operating environments and adequacy of American military power, Index staff and outside reviewers evaluated separately the threats according to their level of provocation (i.e., their observed behavior) and their actual capability to pose a credible threat to U.S. interests on a scale of 1 to 5, with 1 representing a very high threat capability or level of belligerency. This scale corresponds to the tone of the five-point scales used to score the operating environment and military capabilities in that 1 is bad for U.S. interests and 5 is very favorable.

Based on these evaluations, provocative behavior was characterized according to five descending categories: benign (5); assertive (4); testing (3); aggressive (2); and hostile (1). Staff also characterized the capabilities of a threat actor according to five categories: marginal (5); aspirational (4); capable (3); gathering (2); and formidable (1). Those characterizations—behavior and capability—form two halves of the overall threat level.

Assessing U.S. Military Power

Also assessed is the adequacy of the United States’ defense posture as it pertains to a conventional understanding of “hard power,” defined as the ability of American military forces to engage and defeat an enemy’s forces in battle at a scale commensurate with the vital national interests of the U.S. The assessment draws on both quantitative and qualitative aspects of military forces, informed by an experience-based understanding of military operations and the expertise of the authors and internal and external reviewers.

It is important to note that military effectiveness is as much an art as it is a science. Specific military capabilities represented in weapons, platforms, and military units can be used individually to some effect. Practitioners of war, however, have learned that combining the tools of war in various ways and orchestrating their tactical employment in series or simultaneously can dramatically amplify the effectiveness of the force committed to battle.

The point is that a great number of factors make it possible for a military force to locate, close with, and destroy an enemy, but not many of them are easily measured. The scope of this specific project does not extend to analysis of everything that makes hard power possible; it focuses on the status of the hard power itself.

This Index assesses the state of military affairs for U.S. forces in three areas: capability, capacity, and readiness.

Capability. Capability is scored based on the current state of combat equipment. This involves four factors: the age of key platforms relative to their expected life span; whether the required capability is being met by legacy or modern equipment; the scope of improvement or replacement programs relative to the operational requirement; and the overall health and stability (financial and technological) of modernization programs.

This Index focused on primary combat units and combat platforms (such as tanks, ships, and airplanes) and elected not to include the array of system and component upgrades (such as a new radar, missile, or communications suite) that keep an older platform viable over time. New technologies grafted onto aging platforms ensure that U.S. military forces keep pace with technological innovations relevant to the modern battlefield, but at some point, the platforms themselves are no longer viable and must be replaced. Modernized sub-systems and components do not entirely substitute for aging platforms, and it is the platform itself that is usually the more challenging item to field. In this sense, primary combat platforms serve as representative measures of force modernity just as combat forces are a useful surrogate measure for the overall military that includes a range of support units, systems, and infrastructure.

In addition, it is assumed that modernization programs should replace current capacity at a one-to-one ratio. Less than a one-to-one replacement assumes risk, because even if the newer system is presumably better than the older, until it is proven in actual combat, having fewer systems lessens the capacity of the force, which is an important factor if combat against a peer competitor carries with it the likelihood of attrition. For modernization programs, only Major Defense Acquisition Programs (MDAPs) are scored.

The capability score uses a five-grade scale. Each service receives one capability score that is a non-weighted aggregate of scores for four categories: Age of Equipment, Modernity of Capability, Size of Modernization Program, and Health of Modernization Program. General criteria for the capability categories are as follows:

Age of Equipment

  • Very Weak: Equipment age is past 80 percent of expected life span.
  • Weak: Equipment age is 61 percent–80 percent of expected life span.
  • Marginal: Equipment age is 41 percent–60 percent of expected life span.
  • Strong: Equipment age is 21 percent–40 percent of expected life span.
  • Very Strong: Equipment age is 20 percent or less of expected life span.

Capability of Equipment

  • Very Weak: Majority (over 80 percent) of capability relies on legacy platforms.
  • Weak: 60 percent–79 percent of capability relies on legacy platforms.
  • Marginal: 40 percent–59 percent of capability relies on legacy platforms.
  • Strong: 20 percent–39 percent of capability relies on legacy platforms.
  • Very Strong: Less than 20 percent of capability relies on legacy platforms.

Size of Modernization Program

  • Very Weak: Modernization program is significantly too small or inappropriate to sustain current capability or program in place.
  • Weak: Modernization programs are smaller than current capability size.
  • Marginal: Modernization programs are appropriate to sustain current capability size.
  • Strong: Modernization programs will increase current capability size.
  • Very Strong: Modernization programs will vastly expand capability size.

Health of Modernization Program

  • Very Weak: Modernization programs facing significant problems; too far behind schedule (five-plus years); cannot replace current capability before retirement; lacking sufficient investment to advance; cost overruns including Nunn–McCurdy breach. (A Nunn–McCurdy breach occurs when the cost of a new item exceeds the most recently approved amount by 25 percent or more or if it exceeds the originally approved amount by 50 percent or more. See 10 U.S. Code § 2433, Unit Cost Reports.)
  • Weak: Facing procurement problems; behind schedule (three–five years); difficult to replace current equipment on time or insufficient funding; cost overruns enough to trigger an Acquisition Program Baseline (APB) breach.
  • Marginal: Facing few problems; behind schedule by one–two years but can replace equipment with some delay or experience some funding cuts; some cost growth but not within objectives.
  • Strong: Facing no procurement problems; can replace equipment with no delays; within cost estimates.
  • Very Strong: Performing better than DOD plans, including lower actual costs.

Capacity. To score capacity, the service’s size (be it end strength or number of platforms) is compared to the force size required to meet a simultaneous or nearly simultaneous two-war or two–major regional contingency (MRC) benchmark. This benchmark consists of the force needed to fight and win two MRCs and a 20 percent margin that serves as a strategic reserve. A strategic reserve is necessary because deployment of 100 percent of the force at any one time is highly unlikely. Not only do ongoing requirements like training or sustainment and maintenance of equipment make it infeasible for the entirety of the force to be available for deployment, but committing 100 percent of the force would leave no resources available to handle unexpected situations.

Thus, a “marginal” capacity score would exactly meet a two-MRC force size, a “strong” capacity score would equate to a 10-plus percent margin for strategic reserve, and a “very strong” score would equate to a 20 percent margin.

Capacity Score Definitions

  • Very Weak: 0 percent–37 percent of the two-MRC benchmark.
  • Weak: 38 percent–74 percent of the two-MRC benchmark.
  • Marginal: 75 percent–82 percent of the two-MRC benchmark.
  • Strong: 83 percent–91 percent of the two-MRC benchmark.
  • Very Strong: 92 percent–100 percent of the two-MRC benchmark.

Readiness. The readiness scores are from the military services’ own assessments of readiness based on their requirements. These are not comprehensive reviews of all readiness input factors, but rather rely on the public statements of the military services regarding the state of their readiness.

It should be noted that even a “strong” or “very strong” score does not indicate that 100 percent of the force is ready; it simply indicates that the service is meeting 100 percent of its own readiness requirements. Often, these requirements assume that a percentage of the military at any one time will not be fit for deployment. Because of this, even if readiness is graded as “strong” or “marginal,” there is still a gap in readiness that will have significant implications for immediate combat effectiveness and the ability to deploy quickly. Thus, anything short of meeting 100 percent of readiness requirements assumes risk and is therefore problematic.

Further, a service’s assessment of its readiness occurs within its size or capacity at that time and as dictated by the Defense Strategic Guidance, National Military Strategy, and related top-level documents generated by the Administration and senior Defense officials. It does not account for the size-related “readiness” of the force to meet national security requirements assessed as needed by this Index.

Thus, for a service to be assessed as “very strong” would mean that 80 percent–100 percent of the existing force in a service meets that service’s requirements for being “ready” even if the size of the service is less than the size required to meet the two-MRC benchmark. Therefore, it is important for the reader to keep this in mind when considering the actual readiness of the force to protect U.S. national security interests against the challenges presented by threats around the world.

Readiness Score Definitions

  • Very Weak: 0 percent–19 percent of service’s requirements.
  • Weak: 20 percent–39 percent of service’s requirements.
  • Marginal: 40 percent–59 percent of service’s requirements.
  • Strong: 60 percent–79 percent of service’s requirements.
  • Very Strong: 80 percent–100 percent of service’s requirements.