{"slug": "day-9-a-local-japanese-sentiment-ai-bert-read-8-years-of-a-line-chat-and-the-ups", "title": "[Day 9] A local Japanese sentiment AI (BERT) read 8 years of a LINE chat, and the ups and downs surfaced from numbers alone", "summary": "A developer used a local Japanese sentiment AI (BERT) to analyze 87,621 messages from an 8-year LINE chat history, running entirely on a DGX Spark to keep the private conversation data on-device. By scoring 66,329 text messages for tone and aggregating monthly message counts, the analysis revealed four distinct phases in the relationship's arc—ramp-up, an 8-month silence, a second peak, and a stable plateau—without the developer re-reading any message content. The tone data showed that the mood had already turned negative in the month before the silence began, suggesting the sentiment shift preceded the drop in communication volume.", "body_md": "Day 9. Today is less about model internals and more of a personal experiment: have a local AI analyze the entire chat history with one LINE friend. (LINE is the dominant messaging app in Japan.)\n\nWhen I exported it, 8 years were sitting there — from the very first message to today. It started, we talked a lot, it went quiet for a while, then picked up again. That whole arc is in there.\n\nBecause the content is what it is, nothing left my machine: everything ran locally on my DGX Spark.\n\nWhat I used: my home AI box (DGX Spark) + a Japanese sentiment model (for tone) + a bigger local model (to guess events from numbers).\n\nRe-reading 8 years of messages one by one isn't realistic. So instead of reading the content, I looked only at the \"shape\" of the conversation — when, how much, and in what tone we talked.\n\nConcretely:\n\nFrom message counts and tone alone, the 8-year arc came out clearly on a chart. Started, went quiet, came back — the flow was visible without me re-reading a thing.\n\n```\nLINE chat export (text)\n        │\n        ▼\n 1. Parse: split each message into {datetime, who, type, text}\n        │   (from here on, message text never leaves the machine)\n        ▼\n 2. Aggregate: monthly counts, time-of-day, reply gaps\n        │\n        ▼\n 3. Tone scoring: classify each of 66k messages pos/neu/neg\n        │\n        ▼\n 4. Turning-point detection: from sudden changes in the numbers\n        │   + also show ONLY the numbers to a bigger AI and ask it to guess\n        ▼\n 5. Answer check: compare against the real timeline\n```\n\nYou can export a LINE chat as text from the chat screen (\"send chat history\").\n\nData size:\n\n| Item | Value |\n|---|---|\n| Span | ~8 years 2 months |\n| Total messages | 87,621 |\n| Text messages | 66,329 |\n| Stickers | 15,605 |\n| Photos | 3,982 |\n\n15,605 stickers… that's a lot.\n\n| Step | Model | What it does | What it sees |\n|---|---|---|---|\n| 3. Tone | Japanese sentiment model (`koheiduck/bert-japanese-finetuned-sentiment` ) |\nscores each message pos/neu/neg | 66k message texts (scores averaged per month) |\n| 4. Turning points | a bigger local model (`Qwen2.5` 72B) |\nguesses \"what happened to these two?\" | only the per-month table of counts + tone scores (no conversation, no words) |\n\nBoth run locally on my own machine.\n\nThis chart is the highlight. Top: monthly message count. Bottom: tone (up = positive, down = negative). The x-axis is months since the conversation started. (Axis labels are in Japanese.)\n\nPlotted, it isn't a steady climb or a flat line — it splits cleanly into \"chapters\": ramp-up → an 8-month silence → a second peak → a stable plateau. Four phases, at a glance.\n\nTone has two peaks of about +0.6, around the start and around when things resumed (overall mean ≈ 0, slightly negative in the later years). The interesting part: in the month *before* the silence, tone had already dropped to −0.1. The mood dimmed before the volume did.\n\nThere are two dips into negative tone. The one before the silence was an \"omen.\" The other is the recent years — not an omen, but the effect of logistics-y messages (\"what time are you home?\") piling up.\n\n💡 Mini-note: how is \"tone\" turned into a number?\n\nThe scoring is done by a Japanese sentiment model. Roughly:\n\n- pre-trained on lots of Japanese text labeled positive / negative\n- judges with context, not just by spotting keywords\n- returns a probability of \"positive-ness\" / \"negative-ness\" per message\n- I used the difference as a per-message score\n\nA few actual judgments (short, name- and place-free one-liners):\n\n| Message | Verdict |\n|---|---|\n| 「楽しかったね！」 (that was fun!) | Positive |\n| 「これめちゃうまい」 (this is so good) | Positive |\n| 「おはようございます」 (good morning) | Neutral |\n| 「もうお家？」 (home already?) | Neutral |\n| 「全く集中できない」 (can't focus at all) | Negative |\n| 「それは悔しいな、、」 (that's frustrating…) | Negative |\n| (a long trip-planning message) | Neutral |\n| (a snappy one-liner sent in a huff) | Negative |\n\nPlain happy lines score positive; logistics (\"good morning\", \"home already?\") score neutral; tiredness or irritation scores negative. Even long, businesslike planning messages lean neutral.\n\nMessage density by weekday × hour (brighter = more).\n\nA clear concentration at 7–9 a.m.!\n\nFirst, the simple method: mechanically pick the points where message volume jumped or dropped, then check against the real timeline.\n\n| Real event | Auto-detected timing |\n|---|---|\n| When it started | exact match |\n| When it went quiet | exact match |\n| When it resumed | exact match |\n| When it got lively again | a few months off |\n| A big life milestone | hard to detect (barely shows in counts) |\n\nSharp volume changes were nailed. But \"a big life milestone\" got missed. So I showed the *same numbers* to the bigger local model and asked \"what happened?\" — and got back:\n\nRather than hunting for a single spike, it reads the whole sequence of numbers as a \"flow,\" so it could pick up even an event that barely moves the counts.\n\nCounts and tone were enough to see the 8-year shape. Silence marks the quiet stretch; a surge marks the resumption — straight off the chart.\n\nGiven only monthly numbers, the model inferred even a barely-visible event (\"something big around here\"), and it lined up with reality. It connects scattered points into one flow.\n\nThe slight negative lean in later years isn't about getting along badly. Logistics messages (\"what time are you home?\") just don't score high. Low score ≠ trouble. It isn't that sentiment analysis is poor — the scores need to be read together with context.\n\n`time<TAB>name<TAB>text`\n\n. Multi-line messages (4,987 of them) are merged back into the previous message.`koheiduck/bert-japanese-finetuned-sentiment`\n\n, a 3-class (pos / neu / neg) Japanese model.Weather forecasts say one temperature, but everyone *feels* it differently. Same degrees, different \"do I need a coat?\" So next I'm building my own personal \"weather officer\" AI: from past weather data, it'll tell me each morning something like \"coat + beanie today.\" Over the next 100 days I'll teach it my own sense of cold — the start of a longer project.", "url": "https://wpnews.pro/news/day-9-a-local-japanese-sentiment-ai-bert-read-8-years-of-a-line-chat-and-the-ups", "canonical_source": "https://dev.to/peppercorn_llm/day-9-a-local-japanese-sentiment-ai-bert-read-8-years-of-a-line-chat-and-the-ups-and-downs-4951", "published_at": "2026-05-29 22:39:10+00:00", "updated_at": "2026-05-29 22:42:27.360922+00:00", "lang": "en", "topics": ["natural-language-processing", "artificial-intelligence", "machine-learning", "ai-tools", "ai-research"], "entities": ["LINE", "DGX Spark", "BERT", "Japan"], "alternates": {"html": "https://wpnews.pro/news/day-9-a-local-japanese-sentiment-ai-bert-read-8-years-of-a-line-chat-and-the-ups", "markdown": "https://wpnews.pro/news/day-9-a-local-japanese-sentiment-ai-bert-read-8-years-of-a-line-chat-and-the-ups.md", "text": "https://wpnews.pro/news/day-9-a-local-japanese-sentiment-ai-bert-read-8-years-of-a-line-chat-and-the-ups.txt", "jsonld": "https://wpnews.pro/news/day-9-a-local-japanese-sentiment-ai-bert-read-8-years-of-a-line-chat-and-the-ups.jsonld"}}