DNA to the rescue — Reflections after the session

Last Saturday, I had the pleasure of speaking with a group of family historians about something that has quietly transformed genealogical research over the past decade: the use of DNA evidence to break through long-standing brick walls.

The session was titled DNA to the rescue. That might sound slightly dramatic, but for many researchers it genuinely feels that way.

For generations, family historians worked almost entirely with documentary records. Civil registration — births, deaths and marriages — together with parish registers, census returns, probate files, land records, newspapers and local histories formed the backbone of reconstructing earlier families.

But the search rarely began in the archives.

More often it began at the kitchen table — with stories told by older relatives, a name written carefully in the front of a family Bible, a bundle of letters tied with ribbon, or a photograph whose faces were half remembered but never quite forgotten. Sometimes it was a diary, a certificate tucked away in a drawer, or some small memento that had travelled quietly through the generations.

These fragments of memory did not always provide clear answers. But they often supplied the first clues — a place name, a relationship, a family rumour — that guided the search into the documentary record.

Piece by piece, patient researchers learned to combine these different strands of evidence. Stories suggested possibilities. Documents confirmed or corrected them. Gradually, the outlines of earlier families began to take shape.

For a long time, this was the only landscape genealogists had to work within.

Today, however, another source of evidence has entered the picture.

DNA testing does not replace the stories, the documents, or the careful reconstruction of families in the records. But it adds something earlier researchers never had: a way of detecting genetic connections between living descendants and using those connections to guide the search through the historical record.

In that sense, DNA has not replaced traditional genealogy.

It has simply added a new layer to the landscape in which the search takes place.

When these different strands of evidence are brought together — family memory, historical records, and genetic connections — remarkable family stories can begin to emerge.

But even then, genealogists encounter a familiar problem.

Sometimes the records simply stop answering the question.

At other times they produce the opposite difficulty — not too little information, but too much.


The genealogical needle in a haystack

One way of describing traditional genealogical research is to say that we are searching for a needle somewhere in a field of haystacks.

When we begin a difficult problem we may be faced with:

  • several individuals with the same name

  • neighbouring parishes where the family might belong

  • incomplete or missing records

  • and very little evidence to distinguish one candidate from another.

At that stage every possibility can appear equally plausible.

This is where many genealogical brick walls form. The records may exist — but the number of potential candidates is simply too large.


What DNA actually changes

During the session I tried to emphasise something that is sometimes lost in discussions about DNA.

DNA does not replace traditional research.

What it does is reorganise the search space.

When someone takes an autosomal DNA test — the most commonly used type of DNA test today — they receive a list of people who share measurable segments of DNA with them. Most of these matches arise because the two individuals have inherited DNA from a shared ancestor somewhere in the past.

In most cases this shared ancestry is genuine, but it is not absolutely guaranteed. Small segments can sometimes appear as false matches, and in some populations DNA may be shared because it has circulated repeatedly within a long-standing regional or community population, rather than deriving from a single identifiable ancestor within the last few centuries.

For that reason, each match should be understood not as proof on its own, but as evidence pointing toward a likely genealogical connection that must be interpreted alongside documentary research.

At first glance these match lists can look overwhelming.

But once we begin to examine shared matches, something interesting starts to happen.

Matches begin to cluster together.

Groups of matches appear who all match one another as well as matching the tester.

These clusters often correspond to real historical families or ancestral lines. Individually, each match represents only a piece of evidence. Taken together, however, the patterns they form can point with increasing strength toward a particular family network in the past.

And at that point the search space begins to shrink.

Instead of searching every haystack in the field, we begin to see which haystack is most likely to contain the needle.

DNA rarely finds the needle for us — but it shows us where the needle is most likely to be hiding.


Where to begin: a practical testing strategy

Once people understand how DNA can help, the next question is usually where to begin.

While several DNA testing companies now exist, my own experience — and that of many other genealogists — is that it is usually most effective to start with AncestryDNA, particularly for Australians whose ancestry is predominantly British and Irish.

The reason is straightforward: AncestryDNA currently has by far the largest testing database, especially for people with heritage from the British Isles. A larger database simply means a greater chance of finding useful matches.

In Australia, the vast majority of DNA testers are at AncestryDNA. Current estimates suggest that more than 95% of testers are in the AncestryDNA database, while fewer than 20% of those testers have transferred their results or tested at other companies or at GEDmatch.

This has an important practical consequence.

If most testers are concentrated in one database, that is where the greatest amount of usable evidence is likely to be found. In many cases researchers can assemble a substantial proportion of their relevant DNA evidence there simply because the pool of testers is so large.

This does not make the analysis effortless — interpreting DNA matches still requires careful comparison, clustering, and traditional genealogical research — but a larger testing database greatly increases the chances that the necessary evidence will be present. Other testing platforms certainly have their strengths, and it is often useful to transfer results or test in multiple databases. Genealogists often describe this as “fishing in all the ponds.”

Many of the same fish will appear in more than one pond, because people sometimes test with several companies or transfer their results between platforms. But some fish appear in only one pond, which is why searching across multiple databases can occasionally reveal matches that would otherwise remain unseen.

Even so, beginning with the largest pond usually provides the strongest starting point. Once the matches there have been explored, the other ponds can then be fished to see whether additional connections appear.


Making sense of the matches

Of course, obtaining DNA matches is only the beginning. The real challenge is understanding what those matches actually mean.

Once a pool of matches has been identified the next step is to look for patterns within those matches.

In recent years AncestryDNA has introduced several tools that make this process considerably easier.

One of the simplest but most powerful tools available to researchers is the ability to organise matches using Groups, which allows testers to tag matches using colour coding. Over time this makes it possible to build clearly defined sets of matches associated with particular family lines.

To explore these groups more deeply, I find Enhanced Shared Matching (ESM) — available through Ancestry’s ProTools suite — particularly valuable.

Ancestry’s standard Shared Matches feature already shows which matches share DNA with both the tester and another match within Ancestry’s reporting thresholds. Enhanced Shared Matching builds on this by showing how much DNA those shared matches share with one another, providing additional context that can help researchers interpret the structure of a network of matches.

This additional information can be especially helpful when examining clusters of more distant matches. While the expanded visibility increases the number of potential connections that can be observed, it also makes careful interpretation more important. By showing the amount of DNA shared between matches, ESM can offer useful clues about how those individuals may relate to one another within the broader family network.

ProTools also includes a Clusters tool that attempts to group genetically connected matches automatically. Some researchers find this useful as an exploratory starting point. In my own work, however, I tend to rely primarily on Enhanced Shared Matching and carefully constructed match groups. Once matches have been systematically organised in this way, automated clusters often add relatively little additional insight, and can lead inexperienced researchers toward incorrect conclusions.

Used thoughtfully, these tools make it much easier to visualise the relationships between matches and to recognise where new matches may fit within the broader genetic landscape.


Tools beyond the testing platforms

Alongside the tools provided within the testing platforms themselves, a number of third-party resources have become extremely valuable for genealogists working with DNA evidence.

One of the most widely used is DNA Painter. While AncestryDNA does not provide chromosome-level segment data that would allow direct segment mapping, DNA Painter remains highly valuable for analytical tools such as What Are The Odds? (WATO) and WATO+. These tools allow researchers to test competing hypotheses about how a DNA match may fit within a reconstructed pedigree by comparing the amount of DNA shared with the amounts expected for different possible relationships.

In practice, WATO is also extremely useful for constructing clear descendant charts from a common ancestral couple. Even when formal hypothesis testing is not required, these visualisations provide a convenient way to organise descendants and position DNA matches within a structured family framework.

When used alongside resources such as the Shared cM Project, these tools allow researchers to evaluate whether proposed relationships are statistically plausible and to compare alternative placements of a match within a family tree.

Another helpful resource is Genealogy Assistant, a browser extension that enhances popular genealogy websites such as Ancestry, MyHeritage, FamilySearch, and FamilyTreeDNA. It integrates more than one hundred additional functions designed to streamline common research tasks, improve navigation, and assist researchers in analysing records and DNA matches while working across multiple genealogy platforms.

More recently, many genealogists have begun experimenting with artificial intelligence tools such as ChatGPT as research assistants. Used thoughtfully, these tools can help organise complex information, explore alternative interpretations, and assist in structuring genealogical arguments. Like any research tool, they require careful oversight — but they are rapidly becoming part of the modern genealogist’s toolkit.

Used thoughtfully, these tools help transform large lists of DNA matches into structured patterns that guide researchers toward the most promising lines of enquiry.


The timeframe of autosomal DNA

Autosomal DNA is particularly powerful because it operates within a genealogically meaningful timeframe.

For most researchers, autosomal DNA is most effective within roughly the last five to seven generations—typically back to third- or fourth-great-grandparents. As testing databases continue to expand and analytical tools improve, this practical range is increasingly extending to fifth-great-grandparents and sometimes beyond, where patterns of shared matches and clustering can still reveal meaningful connections.

Within this timeframe, DNA matches are often numerous enough to form recognisable clusters of descendants from the same ancestral couples. These clusters allow researchers to reconstruct networks of related testers and increasingly identify great-great-grandparents and sometimes even earlier ancestors. The effectiveness of this approach is strengthened when older generations are tested, when multiple descendants from different branches of a family are included to provide broader genetic coverage, and when researchers are able to work with the match lists of several related testers so that matches below the reporting thresholds within platforms such as AncestryDNA can be incorporated into the analysis.

It is these intersecting clusters of shared matches—revealed through tools such as shared-match analysis and Enhanced Shared Matching—that allow genealogists to reconstruct family networks, identify the ancestral lines from which those matches descend, and in many cases overcome long-standing genealogical brick walls. In this way, clusters of shared matches do not by themselves reveal the answer, but they can narrow the search dramatically, guiding researchers toward the particular ancestral “haystack” in which the documentary needle is most likely to be found.

Beyond this range the signal gradually weakens as recombination breaks DNA segments into progressively smaller pieces with each generation. Even so, patterns of shared matches can still provide valuable clues when interpreted within the broader network of related testers. As research extends further back in time, however, the analysis increasingly benefits from platforms that provide a chromosome browser, allowing shared DNA segments to be examined directly.


The other three types of DNA

During the session we also touched on the other three types of inherited DNA that genealogists can use.

These are:

  • Y-DNA, which follows the direct paternal line

  • mitochondrial DNA (mtDNA), which follows the direct maternal line

  • X-DNA, which forms part of the autosomal test but follows a distinctive inheritance pattern.

Each operates on a different timescale and answers different questions.

Y-DNA and mitochondrial DNA can extend far deeper into the past — sometimes hundreds or even thousands of years — because they change much more slowly across generations.

However, they follow only single lines of descent.

Autosomal DNA, by contrast, samples ancestors across all branches of the pedigree within the recent genealogical timeframe.


Proving the pedigree

Because of this, autosomal DNA often plays a crucial role in establishing the pedigree first.

Before Y-DNA or mitochondrial DNA can be used effectively in a genealogical investigation, researchers need confidence that the documentary line connecting the modern tester to the historical ancestor is correct. Autosomal DNA provides the strongest framework for that verification. Clusters of matches drawn from multiple descendant lines can show whether a reconstructed pedigree behaves as expected genetically.

Once that foundation is secure, Y-DNA or mitochondrial DNA can sometimes be used to explore specific paternal or maternal questions in much greater depth.

In simple terms:

Autosomal DNA builds the structure of the tree.
Y-DNA and mitochondrial DNA allow us to examine particular branches of that tree more closely.

When these different forms of evidence are brought together, patterns begin to emerge within the DNA matches. Clusters of related descendants appear, and the search space begins to narrow. Instead of working across an entire landscape of possible families, the investigation gradually concentrates on a much smaller network of people whose descendants appear repeatedly within the DNA evidence.

This shift—from a wide field of possibilities to a much more concentrated group of families—is exactly what makes DNA so powerful in genealogical research.

A clear example of this process can be seen in my own ongoing investigation into the origins of my second great-grandfather, William Webb Wagg.


A practical example: the William Webb Wagg investigation

Much of what I spoke about in the session reflects the experience of my own ongoing investigation into the origins of my second great-grandfather, William Webb Wagg. The full investigation is documented in a series of posts available online (link).

When that research began, the initial question seemed deceptively simple:

Who was William Wagg, the man who married Sarah Turner in Sydney in 1853?

At that stage the documentary record provided only fragments. As additional records came to light, the question evolved. Evidence suggested that this man might in fact be William Wagg, transported to Van Diemen’s Land from Norfolk in 1845, but establishing his English origins proved far more difficult.

Norfolk in the late eighteenth and early nineteenth centuries contained several families bearing the surname Wegg and related surnames, spread across a cluster of neighbouring parishes. On the documentary record alone, more than one plausible candidate could be identified.

Complicating matters further was the fact that the surname itself had changed. In the Norfolk records the name appears consistently as Wegg, while in Australia it appears as Wagg. Recognising that these spellings likely referred to the same family opened the investigation to a much wider group of possible candidates within the Norfolk records.

At the same time, a professional researcher working in the 1970s had concluded that William Wagg had been born in Surrey and had arrived in Sydney as a young child with his family. That interpretation circulated widely and was subsequently adopted by many family historians, becoming the accepted explanation for William’s origins.

Yet when the surviving documentary evidence was examined closely, the situation proved far less certain. The Norfolk records contained several individuals whose circumstances could potentially fit the known facts, while the Surrey hypothesis rested on limited supporting evidence.

In effect, the documentary landscape presented a familiar genealogical problem: several plausible candidates, none of whom could be confirmed from the records alone.

It was precisely the kind of situation in which DNA evidence had the potential to reshape the investigation.

Autosomal DNA changed the nature of the problem.

Descendants of William Webb Wagg began testing. When their shared matches were examined, clusters of DNA matches appeared whose documented ancestry repeatedly traced back to the same small group of Norfolk families.

Again and again the same surnames and the same parishes appeared: Plumstead, Baconsthorpe, Hempstead, and neighbouring communities.

The DNA evidence did not immediately identify William’s parents. What it did was something equally important: it demonstrated that the Webb Wagg descendants belonged genetically within that particular Norfolk kin network.

Once the search space had narrowed to those interconnected families, the documentary reconstruction could proceed much more effectively. Parish registers, family relationships, and geographic continuity gradually revealed a coherent structure linking the Wegg and Broughton families across several generations.

In other words, the investigation followed exactly the pattern discussed in the session.

DNA did not replace the records.

But it showed me which haystack to search.

Once the search was focused in the right place, the historical evidence began to fall into place in a way that had previously seemed impossible.


A reminder about unexpected discoveries

One final point we discussed during the session is that DNA sometimes reveals unexpected family connections.

Studies across large DNA databases suggest that approximately one in four people who take a DNA test discover a misattributed parentage event somewhere between themselves and their great-grandparents.

In other words, there is roughly a 25% chance that at least one of the fourteen closest ancestors in our pedigree may not be the biological parent recorded in the documentary record.

This can feel confronting at first.

But it simply reflects the complexity of real human lives. Family history is rarely as tidy as the official records might suggest.

As we move further back in time, the number of ancestors in our pedigree increases rapidly with each generation. At the same time, the cumulative chance that one or more misattributed parentage events may have occurred somewhere in those lines also increases.

This has important implications for the use of lineage-specific DNA tests such as Y-DNA, which follow the paternal line from father to son. When the documentary pedigree is correct, Y-DNA can provide powerful confirmation of a paternal lineage. However, if a break has occurred somewhere in that line, the results may be less straightforward.

Y-DNA and mitochondrial DNA are generally most useful when there is already a reasonably well-documented pedigree to test. Without that framework they can be difficult to use as a discovery tool, because they follow only a single ancestral line. A close match can provide strong evidence that two testers share a common paternal or maternal ancestor, but if no match appears it is often impossible to determine where a break may have occurred—particularly if the pedigree of the testers has not yet been independently verified, or if the connection being explored lies beyond the timeframe where autosomal DNA can reliably assist.

Autosomal DNA, by contrast, reflects contributions from many different ancestral lines and is therefore far more effective as an exploratory tool. It is much better suited to identifying previously unknown connections and guiding researchers toward the families where further investigation should begin.

Autosomal DNA helps us find the family, while Y-DNA and mitochondrial DNA may help us confirm the line.

DNA therefore does not replace traditional genealogical research; rather, it adds another layer of evidence that helps us test, refine, and sometimes reconsider the family histories recorded in the documentary record.


Why DNA feels like a rescue

For many genealogists, the most exciting aspect of DNA research is that it reopens problems that once seemed permanently closed.

Brick walls that resisted years of documentary searching suddenly become approachable again.

Not because the records have changed.

But because we now have a new kind of evidence that helps us see the landscape differently.

Seen in that light, the Webb Wagg investigation illustrates very clearly how DNA reshapes genealogical research.

The documentary record still holds the story.

But the DNA evidence reveals the landscape in which that story sits.

Through clusters of shared matches, autosomal DNA pointed repeatedly to the same group of interconnected Norfolk families, dramatically reducing the range of possible explanations.

What had once looked like a field of equally plausible haystacks became a much smaller and more manageable search area.

The needle still had to be found through careful documentary work.

DNA does not usually find the needle for us — but it can show us which haystack to search in a field that once seemed impossibly vast.

Popular posts from this blog

MODULE 2: Grouping your AncestryDNA matches - The Leeds Method and my "twisted" Leeds Method

MODULE 1: Working with AncestryDNA (Starting to solve your DNA dilemmas!)

Click here to find the links to all my musings!