Hawkometer Methodology — How We Score Central Bank Speeches
The scoring engine, the phrase library, and the limitations finance journalists need to know about
What the Hawkometer measures
The Hawkometer answers one question: based on what a senior central bank official has said in their public appearances, where do they sit on the hawkish-dovish spectrum, and is that position drifting?
It is not a measure of:
- What rate they will vote for at the next meeting (that is what our probability and Taylor Rule tools are for).
- Their long-run reaction function.
- Their personal beliefs separate from the committee’s published guidance.
It is a measure of public communication tone, normalised across speakers and aggregated up to the committee level.
The scoring engine
Every speech, interview, testimony or press conference is run through a phrase scorer:
score(text) = clip(-10, +10,
sum over hawkish phrases of (weight * f(count))
- sum over dovish phrases of (weight * f(count)))
Where f(count) applies sub-linear scaling for repeated phrases (f(1) = 1, f(2) ≈ 1.69, f(3) ≈ 2.10). This prevents a speaker from gaming the score by repeating one phrase ten times, while still rewarding emphasis when a phrase is used more than once.
The clipping at +/- 10 ensures that no single speech, no matter how strongly worded, can dominate a committee aggregate.
The phrase library
The library is in two parts. Hawkish phrases push the score up; dovish phrases push it down. A representative sample is shown below; the full library is published as part of the data feed (hawkometer.json → phrase_library).
Hawkish phrases (selection)
| Phrase | Weight |
|---|---|
| inflation persistent | +2.5 |
| inflation remains elevated | +2.2 |
| further tightening | +2.4 |
| additional tightening | +2.4 |
| more work to do | +2.0 |
| premature to declare victory | +2.0 |
| not the time to cut | +1.9 |
| upside risks to inflation | +1.8 |
| inflation expectations rising | +1.8 |
| policy must remain restrictive | +1.7 |
| sticky inflation | +1.6 |
| overheating | +1.6 |
| restrictive stance is appropriate | +1.5 |
| vigilance / vigilant | +1.4 |
| hawkish hold | +1.4 |
Dovish phrases (selection)
| Phrase | Weight |
|---|---|
| appropriate to begin easing | -2.4 |
| scope to ease | -2.2 |
| disinflation is well advanced | -2.0 |
| conditions for easing | -2.0 |
| rate cuts on the table | -2.0 |
| dovish pivot | -2.0 |
| approaching neutral | -1.8 |
| close to neutral | -1.6 |
| policy is sufficiently restrictive | -1.6 |
| disinflation continues | -1.5 |
| labour market cooling | -1.4 |
| recession risk | -1.4 |
| downside risks | -1.4 |
| financial conditions have tightened | -1.2 |
| growth is slowing | -1.2 |
Some phrases are deliberately ambiguous — “data-dependent” is technically a process statement, but in current usage it skews mildly dovish, so it sits at -0.7. We document each judgement call rather than hide it.
From speeches to committee scores
Once individual speeches are scored, we aggregate in three layers:
1. Per-speaker rolling 90-day average
For each official we compute the simple average of their sentiment scores over the last 90 days. We also compute a 30-day average and a prior-60-day average so a shift indicator can be reported (last 30 days vs. the 60 days before that).
2. Per-bank, voter-weighted committee score
The committee score is the weighted average of each speaker’s 90-day score:
| Status | Weight |
|---|---|
| Voter at the next meeting | 1.0 |
| Non-voter (e.g. non-voting Fed regional president, observer) | 0.55 |
A non-voter is still part of the committee’s intellectual centre of gravity, so we don’t drop them entirely — but a voter’s voice counts roughly twice as much in the index, which matches how markets price these officials.
3. Cross-bank lean
Per-bank scores are translated into a verbal lean for human readers:
| Score range | Lean |
|---|---|
| ≥ +2.5 | hawkish |
| +1.0 to +2.5 | leaning hawkish |
| -1.0 to +1.0 | neutral |
| -2.5 to -1.0 | leaning dovish |
| ≤ -2.5 | dovish |
These are descriptive labels, not predictions about the next decision.
Limitations — read this before citing the index
The Hawkometer is a useful first cut, but it has real limitations. Anyone citing it in research or reporting should understand them.
Context blindness. The scorer matches phrases, not arguments. A speaker saying “the case for additional tightening has weakened materially” will, in the current implementation, register additional tightening as a hawkish phrase even though the surrounding sentence makes the meaning dovish. We mitigate this by aggregating across many phrases and many speeches; we have not yet attempted to handle negation explicitly. This is a known weakness.
Novel language. When an official invents a new turn of phrase that markets immediately interpret as hawkish or dovish, the index will not pick it up until we add the phrase to the library. We do a quarterly review and an emergency update after major regime-change speeches.
Hypotheticals and conditionals. “If inflation were to re-accelerate, additional tightening would be appropriate” reads more hawkish to the scorer than the speaker intended. Again, weight comes from many speeches, not from a single sentence — but readers should not over-interpret a single high reading from a single appearance.
Translation effects. ECB speakers, BoJ officials and SNB Board members frequently speak in languages other than English. Our scorer currently runs only on the English version of those remarks. When the official version is non-English, scoring is delayed until a translated transcript is published; for press conferences we use the simultaneous interpretation transcript. This introduces a small lag and a small translation bias.
No causal claim. The index does not claim that a hawkish reading causes anything. It is an organised summary of what officials have said. Use it alongside market-implied probabilities, the Taylor Rule analysis and the rate path tools.
Sample data. While the production scrapers ramp up, some entries on the site may be representative samples rather than direct transcripts. Each sample-derived speech is flagged with is_sample: true in the JSON feed so downstream consumers can filter them out if needed.
Update cadence
The pipeline runs daily on the Central Bank Watch backend. New speeches typically appear within a few hours of delivery once the production speech feeds are connected. The phrase library is reviewed at the end of each quarter and any time a major central bank materially changes its communication style.
Reproducibility
Every score is fully reproducible from the raw text and the phrase library. The complete dataset (speakers, speeches, aggregates and the full phrase library) is published as hawkometer.json in the site’s data feed. If you find a scoring decision you disagree with, you can reproduce the calculation by hand and tell us which phrase you would weight differently.
For methodology questions or to suggest a phrase to add or re-weight, see the about page.