Echoes of Austen and Dickens: Recreating Literary Style with AI

Author: Denis Avetisyan

Researchers have shown that large language models can convincingly mimic the writing styles of 19th-century novelists with remarkably little guidance.

The comparative analysis assesses generated sentence style using both a full fine-tuning (FFT) approach and a low-rank adaptation (LoRA) method, with focused evaluation on the FFT results to discern stylistic nuances.

Fine-tuning with single-token prompts enables stylistic text generation and offers insights into model behavior through attention and gradient analysis.

While computational stylometry offers powerful tools for authorship analysis, generating and reliably evaluating stylistic imitation remains a significant challenge, particularly without paired training data. This paper, ‘Generation, Evaluation, and Explanation of Novelists’ Styles with Single-Token Prompts’, introduces a framework for both generating text in the voices of 19th-century novelists and assessing its stylistic fidelity using transformer-based detection and explainable AI techniques. Our findings demonstrate that large language models can be effectively fine-tuned with minimal prompting to capture distinctive authorial patterns, and that AI-driven evaluation offers a robust alternative to human judgment. What linguistic cues most powerfully define an author’s voice, and can these be systematically identified and replicated by artificial intelligence?

From Recognition to Generation: A History of Chasing Ghosts

Natural Language Processing has undergone a significant evolution in its approach to authorial style. Historically, the field centered on recognizing an author’s unique linguistic fingerprint – identifying patterns in vocabulary, syntax, and even punctuation to attribute text correctly. However, contemporary research now prioritizes the far more ambitious goal of generating text that convincingly mimics a specific author’s style. This shift demands a move beyond merely identifying existing patterns; it requires models to actively create new text imbued with the subtle nuances that define an author’s voice – a transformation from passive analysis to active creation, promising a future where machines can not only understand how someone writes, but also write like them.

The shift from identifying an author’s style to authentically reproducing it necessitates a departure from traditional natural language processing techniques. Earlier methods often relied on identifying frequent words or syntactic structures – a form of pattern matching – but these approaches prove insufficient for capturing the essence of individual writing. Nuanced stylistic imitation demands models capable of discerning subtle cues-idiosyncratic vocabulary choices, variations in sentence rhythm, and unique figurative language-that define an author’s voice. This requires a move towards more sophisticated computational linguistics, leveraging techniques like neural networks and large language models to understand and replicate the complex interplay of elements that constitute a distinctive style, moving beyond mere statistical correlations to a deeper comprehension of authorial intent and expression.

Replicating an author’s distinctive style isn’t merely about identifying frequently used words or sentence structures; it demands a far more sensitive computational approach. Previous natural language processing models struggled with this nuance, often producing text that felt generically ‘written’ rather than convincingly attributable to a specific voice. The challenge lies in capturing the subtle cues – the idiosyncratic rhythm, the preferred use of figurative language, the characteristic patterns of punctuation, and even the subtle variations in vocabulary choice that collectively define an author’s style. Modern models, however, are increasingly equipped to analyze these complex features, leveraging advanced techniques to discern patterns previously beyond the reach of algorithmic analysis and, crucially, to incorporate them into the text generation process.

The recent surge in computational stylometry and text generation owes a significant debt to the increasing availability of digitized literary works. Projects like Project Gutenberg, which offer free access to tens of thousands of ebooks, have created a previously unimaginable corpus for training and testing stylistic models. This wealth of data allows researchers to move beyond limited, curated datasets and explore the nuances of authorial voice across a broad spectrum of writing. The sheer volume of text enables algorithms to identify subtle patterns – from characteristic word choices and sentence structures to preferred punctuation and thematic concerns – that define an author’s unique style. Consequently, these digitized collections are not merely archives of literature; they are the very foundation upon which convincing stylistic imitation and text generation are now built, offering the raw material for machines to learn and replicate the art of writing.

Integrated Gradients highlight the tag identifier's influence on next-token prediction, demonstrably affecting generated sentence content as illustrated by examples such as 'Aha!' and 'I am the happiest creature in the world'. — Integrated Gradients highlight the tag identifier’s influence on next-token prediction, demonstrably affecting generated sentence content as illustrated by examples such as ‘Aha!’ and ‘I am the happiest creature in the world’.

GPT-Neo: A Decent Start, But Don’t Expect Miracles

GPT-Neo is an autoregressive language model utilizing a decoder-only transformer architecture. It comprises 125 million to 2.7 billion parameters, enabling it to generate coherent and contextually relevant text. As an autoregressive model, GPT-Neo predicts the next token in a sequence based on preceding tokens, making it well-suited for text generation tasks. Its architecture allows for parallelization during training, improving efficiency. The model is pre-trained on the Pile, a large, diverse dataset comprising text from various sources, providing a broad understanding of language and enabling effective transfer learning to downstream tasks such as stylistic text generation. This pre-training establishes a strong foundational capability prior to task-specific fine-tuning.

Efficient fine-tuning techniques are essential for tailoring GPT-Neo to specific authorial styles. Full Fine-Tuning (FFT) updates all model parameters, offering maximum adaptability but requiring substantial computational resources and storage. Conversely, Low-Rank Adaptation (LoRA) freezes the pre-trained model weights and introduces trainable low-rank decomposition matrices, significantly reducing the number of trainable parameters – typically by 10,000x – and thus lowering computational cost and memory requirements. LoRA achieves comparable performance to FFT in many cases while enabling faster training and easier deployment, making it particularly suitable for resource-constrained environments or large-scale author adaptation projects. Both methods leverage gradient descent to minimize the loss function between generated text and the target author’s style.

Fine-tuning GPT-Neo on the works of authors such as Dickens, Twain, and Austen yields a test set accuracy of 87%. This performance indicates the model’s ability to statistically capture and reproduce distinctive authorial styles. The process involves adjusting the pre-trained model’s parameters using a dataset comprised of each author’s writing, enabling it to predict text sequences in a manner consistent with that author’s characteristic vocabulary, syntax, and phrasing. Accuracy is determined by comparing the model’s generated text to held-out samples of the author’s work, measuring the frequency of correct word predictions within the given context.

Training generative models on unpaired data – text where direct authorial correspondence is absent – introduces significant challenges to stylistic adaptation. Unlike parallel corpora that facilitate direct mapping of style, unpaired data necessitates the model infer stylistic characteristics from the data itself. This requires adaptation strategies capable of discerning subtle linguistic patterns without relying on explicit authorial signals. Approaches must account for variations in vocabulary, syntax, and thematic content that are not directly attributable to a specific author, potentially leading to stylistic drift or the generation of homogenized text. Robust techniques are therefore crucial to prevent the model from averaging out individual authorial voices and instead accurately capturing nuanced stylistic features present in the source material.

Integrated gradients reveal token-level attribution, highlighting the contribution of each input token to the model's output. — Integrated gradients reveal token-level attribution, highlighting the contribution of each input token to the model’s output.

Unveiling the ‘How’: Explainable Stylistic Imitation, Because Black Boxes are Annoying

Explainable AI (XAI) techniques are utilized to deconstruct the mechanisms by which the model achieves stylistic imitation. These techniques move beyond simply observing that stylistic transfer occurs, and instead focus on identifying which aspects of the input data contribute to the generation of specific stylistic features. The application of XAI allows for an internal examination of the model’s decision-making process, providing insights into how input tokens are weighted and processed to manifest a target style in the output text. This detailed analysis is crucial for validating the model’s behavior and understanding the features it prioritizes during stylistic transformation.

Attention mechanisms and gradient-based analysis are utilized to determine the impact of specific input tokens on the generated stylistic output. Attention weights quantify the relationships between input and output tokens, highlighting which input elements the model focuses on during generation. Gradient-based methods, such as calculating the gradient of the output style with respect to the input tokens, reveal the degree to which each input token influences the generated style. Higher gradient magnitudes indicate a stronger influence. These techniques allow for the identification of key linguistic features within the input that are most responsible for shaping the stylistic characteristics of the generated text, providing insight into the model’s decision-making process regarding style transfer.

Integrated Gradients (IG) analysis reveals the linguistic features responsible for stylistic imitation by attributing each input token’s contribution to the final prediction. Specifically, IG calculations demonstrate that attention mass dedicated to author tags increases as information propagates through deeper layers of the transformer model, reaching up to a 28% increase in later layers. This indicates that the model increasingly focuses on and utilizes author-specific cues embedded within the input to shape the generated stylistic output, effectively learning and replicating the author’s unique linguistic fingerprint.

Stylistic fidelity of generated text is rigorously evaluated using a Transformer-Based Detector trained on a corpus of authentic writing samples. This detector functions as a classifier, identifying the author of a given sentence. Performance metrics demonstrate 82% agreement between the author specified in the generator prompt and the author identified by the classifier, but only for predictions exceeding a pre-defined confidence threshold. This high level of agreement indicates the generated text successfully replicates stylistic characteristics detectable by the detector, providing quantitative validation of the stylistic imitation process.

Expanding the Toolkit: Style Transfer and Future Directions – Because We Can’t Just Stop Here

The development of GPT-Neo established a crucial base for more sophisticated text manipulation techniques, notably content-preserving style transfer. This process moves beyond simple imitation; it allows for the disentanglement of what is said from how it is said. Researchers leverage the foundational capabilities of models like GPT-Neo to isolate and then reapply stylistic elements – vocabulary, sentence structure, tone – onto new content, effectively rewriting text in the voice of a chosen author or within a specific genre. This isn’t merely about swapping words; it involves maintaining the original meaning while authentically replicating the nuances of a target style, opening doors to applications like automated paraphrasing with stylistic control and the creation of text that harmonizes with a pre-defined aesthetic.

Beyond the capabilities of models like GPT-Neo, alternative architectures are rapidly advancing the field of controllable text generation. Generative Adversarial Networks (GANs) pit two neural networks against each other – a generator that creates text and a discriminator that evaluates its authenticity – leading to increasingly realistic and nuanced outputs. Simultaneously, Diffusion Models, inspired by physics, learn to reverse a process of gradual noise addition, allowing for the creation of text from pure randomness with remarkable control over style and content. These models offer complementary strengths; GANs excel at capturing intricate stylistic details, while Diffusion Models often demonstrate superior semantic coherence and stability, opening avenues for fine-grained manipulation of textual characteristics beyond simple imitation.

The ability to manipulate textual style extends far beyond mere imitation of historical authors. Content-preserving style transfer, and related generative techniques, unlock a spectrum of possibilities for crafting communication tailored to specific audiences and purposes. Imagine automated systems composing marketing materials with a consistently on-brand voice, generating personalized educational content adapting to a student’s learning style, or even assisting individuals in refining their own writing to achieve a desired tone – from formal and professional to casual and empathetic. Furthermore, these technologies offer creative avenues for authors and artists, providing tools to explore new narrative voices and experiment with different stylistic expressions, potentially revolutionizing content creation across diverse platforms and media.

Ongoing investigations are increasingly focused on the delicate balance between faithfully replicating a chosen writing style, maintaining the logical meaning of the generated text, and achieving a genuinely pleasing aesthetic experience for the reader. Researchers are probing how these three elements – stylistic fidelity, semantic coherence, and aesthetic quality – interact and potentially conflict during text generation. The goal isn’t simply to mimic surface-level features, but to understand how deeply ingrained stylistic choices impact the perceived meaning and artistic merit of a text. This involves developing new metrics to evaluate generated text beyond traditional measures of fluency and grammatical correctness, and exploring techniques that allow for nuanced control over these interwoven aspects of creative writing and communication.

The pursuit of capturing ‘style’ via large language models feels predictably Sisyphean. This paper details a method for mimicking 19th-century novelists, fine-tuning models with minimal prompts – a neat trick, certainly. However, the underlying assumption-that ‘style’ is a static, quantifiable thing-ignores the inevitable entropy of language. As Bertrand Russell observed, “The difficulty lies not so much in developing new ideas as in escaping from old ones.” The models may generate text resembling Austen or Dickens, but they cannot replicate the cultural context, the lived experience, or the slow evolution of language itself. It’s a temporary illusion, a sophisticated form of imitation. One anticipates the emergence of ‘style drift’ – the model inevitably diverging from its initial training, becoming something…else. The attention and gradient analysis offer a glimpse into the mechanism, but offer little protection against the relentless march of production finding a way to break the elegant theory.

The Road Ahead

The demonstration that stylistic mimicry can be achieved with such sparse prompting is…efficient. It raises the inevitable question of what constitutes ‘style’ if it can be distilled into a few gradient updates. The current work identifies how to generate, but sidesteps the more troublesome ‘why’ anyone would want a statistically convincing imitation of George Eliot at scale. One suspects the use cases will be…creative. The attention mechanisms, while offering a surface-level explanation, merely shift the problem to interpreting the model’s internal representations – a familiar loop for those acquainted with Explainable AI.

The real challenge, predictably, won’t be achieving higher fidelity, but managing the resulting chaos. Legacy stylometric features, carefully curated over decades, will become increasingly irrelevant as models learn to exploit – and then inevitably break – those very patterns. The fine-tuning approach, while elegant, offers limited control; a slight drift in the training data, a new corpus of text, and the carefully crafted ‘Eliot’ will begin to sound suspiciously like a chatbot with pretensions.

It’s a memory of better times, this focus on attribution and explanation. Soon enough, the goal will shift from understanding how a model writes like a novelist to simply preventing it from hallucinating legal disclaimers into the prose. The bugs, after all, are proof of life.

Original article: https://arxiv.org/pdf/2511.20459.pdf

Contact the author: https://www.linkedin.com/in/avetisyan/

From Recognition to Generation: A History of Chasing Ghosts

GPT-Neo: A Decent Start, But Don’t Expect Miracles

Unveiling the ‘How’: Explainable Stylistic Imitation, Because Black Boxes are Annoying

Expanding the Toolkit: Style Transfer and Future Directions – Because We Can’t Just Stop Here

The Road Ahead

See also: