Science
May 27, 20261
50%
Personal Data Analysis: Individual Examines 20 Years of Digital Communications to Map Friendships and Life Patterns

An individual analyzed 20 years of digital communications spanning 1.2 million messages across multiple platforms to understand their social relationships and emotional patterns, moving beyond traditional life-tracking to capture qualitative aspects of friendships and personal fulfillment.





Quick Facts
Who
Individual analyst (author not named)
What
Analyzed 20 years of personal digital messages
When
2000s-2020s (communication period)
Where
Post-Soviet space (VK usage)
- Analyzed 20 years of personal digital messages
- Extracted and structured 1.2 million messages
- Built a personal CRM system from communication archives
- Parsed data from five different social platforms
- Analyzed longest conversation thread of 486,000+ messages
An individual has conducted an extensive analysis of their 20-year digital communication history, extracting and structuring approximately 1.2 million messages from multiple platforms to better understand their social relationships and emotional patterns. The project was motivated by a desire to move beyond traditional life-tracking methods—such as Tim Urban's "Life in Weeks" grid—to capture the qualitative aspects of relationships and personal fulfillment rather than just major life events.
The analysis drew from five major data sources spanning different eras of online communication: ICQ, IRC, and DC++ from the 2000s; VK, Twitter, and Facebook from the 2010s; and Instagram and Telegram from the 2010s-2020s. Using GDPR data access requests and similar legal frameworks, the individual obtained complete archives including messages, reactions, and social graphs. The technical work involved parsing and normalizing data from multiple formats—JSON and HTML files—while managing platform-specific quirks such as Instagram's double-encoded Cyrillic characters, Telegram's variable message IDs across exports, and Facebook's E2E encryption creating duplicate entries across folders.
After consolidating the disparate data sources into a uniform format, the analysis revealed significant patterns in communication noise and substance. In the longest conversation thread—486,000+ messages exchanged over ten years with a single partner—only 58.7 percent constituted substantive text, while 41 percent consisted of fillers, emoji-only messages, links, and media. The project aimed to build a personal Customer Relationship Management (CRM) system grounded in actual communication records rather than memory, enabling identification of patterns in emotional bandwidth, friendship cycles, and relationship half-lives that would otherwise remain invisible to individual perception.
Topics
Why This Matters
This analysis demonstrates how individuals can leverage digital exhaust—the accumulated data of everyday online interactions—to gain unprecedented insight into social relationships and emotional patterns. Rather than relying on faulty memory or surface-level metrics, the project creates a data-driven foundation for understanding which relationships matter most and how emotional bandwidth evolves over decades. For knowledge workers and those interested in personal knowledge management, this methodology offers practical frameworks for meaningful self-reflection and data-driven life decisions.
Timeline & Sources
Jan 1, 2008
WireVK social network archives begin
Jan 1, 2014
WireTim Urban publishes 'Your Life in Weeks' on WaitButWhy, which inspired personal data tracking motivation
May 27, 2026
WirePublication of analysis findings on Hacker News