So Square Enix did it again... they sprang one of the best games of the last few years without any noise and made Paranormasight 2 (The Mermaid's Curse). Amid 7 years of nonstop RPG releases and remakes, these two titles stand out as something you'd never guess they'd do, especially since it's a limited budget game (there's only about 20 names in the credits, compare to the AAA size that usually involves hundreds of people).
What can I say? They nailed it again. If you remember I talked about paranormasight 1 a few months ago here, contrasting Famicom Detective Club to it.
Paranormasight 2 hits just as right - and it's no surprise, considering how closely it follows the formula. Yet even as I thought I knew how to watch out for the tricks the game would pull, I still found myself taken by surprise most of the time.
It's a game that engages you, the player, to participate in its story and not just consume it. It rewards your deductions even if they're wrong or incomplete yet, and then likes to recontextualize them - what you thought you knew for sure turns out to be something else, things like that. This hasn't changed from the first game, and the enigmatic Storyteller even makes a return to carry you through this story. Although I will say I did make some correct deductions far before the game needed me to, but it really didn't spoil or ruin anything.
And like I said in my review of the first game, this isn't a game about the supernatural. At its core, this is a story of human drama and correcting the errors of the past. It starts with a young man wanting to find a mermaid because he believes his mother, who has disappeared five years ago during a storm at sea, might have been one. But then it turns out there's also in folklore "fish-mermaids" and "human-mermaids" - and yes, the difference is important. It is said that eating a mermaid's flesh can grant anyone immortality (apparently this is true in Japanese folk stories), but actually it might only be 800 years and actually there might be other conditions as well, possibly. Maybe. Also there's dead bodies washing up on shore after an underwater pit opened up, and an unknown girl mysteriously arrived on the island where most of the game takes place just two months ago... also you should probably learn the Heike clan's lineage and some centuries later an island in the bay of Ise disappeared under the waves never to be seen again. I'm sure that's nothing though right?
And this is just the prologue, I'm not even spoiling anything. This is how the game starts you, and it's up to you to figure everything out piece by piece. It starts with a very strong supernatural/horror vibe (don't worry there are no jumpscares), but that's really not where the meat of the game is. I like this approach; the supernatural serves to sublimate and elevate the underlying story, not to pull a red herring like many other games do. In the first game, you the player learned that curses that could kill people were real because the prologue had you use them liberally. It left no doubt in your mind that you had to suspend disbelief and whatever the game was throwing at you was going to be the truth. In the second game, they expedited this a little and there is some dissonance in the beginning, where people who would have no reason to believe in the supernatural instantly believe you.
Still, the dialogue hits all the right notes. It will make you laugh, it will make you cry, it will make your heart palpitate. You can be having a completely normal conversation in one moment when it suddenly turns into a tense standoff.
The story chart has come back, this time with 4 pairs of protagonists instead of just 3. It was a bit harder to follow in the beginning especially with all the history you're suddenly fed with, but it mellows out as the game goes. This is a series that plays with what a game means, and although there is less of that in this sequel (the first game had you actually use video game mechanics to progress), the story chart remains relevant. But it did get confusing a little in the beginning trying to put events in their linear order, when you're jumping all around both in time and protagonists.
The game definitely strings you along exactly as it wants to, and you're just here for the ride. But you gotta trust that it will bring you where you need to be in due time. It's a beautiful story, much like the first game, and very bittersweet. I remember reading a while back we don't like finishing things because it means it has ended and we have to move on. We'd like things to remain in a metaphysical state, never budging, so that we don't have to say our goodbyes.
Mind you, I haven't actually finished the game yet - I decided to pace myself for this one, and play a couple hours a day. Judging from the steam reviews it seems to be around 12 hours long like the first one, and I'll reiterate what I said about the first one too: I am ready to play 12 hours more lol. But I guess this is what the game teaches you in a way, things can't last forever. Even immortals die eventually.
Well, a big theme of the games like I said is correcting the errors of the past, i.e. having agency to change things. Which I think is a powerful message to convey; too often protagonists are passive in the face of what happens around them, being carried by the waves without even attempting to swim against the current. So it's refreshing to have a story that says, it's never too late to make things right. This is why Paranormasight is about deeply human stories at the end of the day even if they rely heavily on the paranormal.
And that's where I am too. I'm in what I believe to be the final stretch of the game, and half of me wants to finish it as soon as possible and the other half wants to take my time so I don't have to move on too quickly. Mind you as a mostly visual novel-type game, you won't have a lot of gameplay either and it's a lot of reading, so it's perfectly fine to take breaks and chip slowly at it. Otherwise it is a lot of sitting there and pressing A to continue.
Oh one last thing. I don't think it has much bearing really but, the island they say is fictional is actually real and the pictures are clearly taken from there lol. Here's a reddit post: https://old.reddit.com/r/ParanormasightHonjo/comments/1rg2wkz/the_completely_fictional_kameshima_island/. I think it was more of a liability thing to rename it and treat it as fictional but I think it's also a caution against not treating all the historical elements the game tells you as true lol.
Oh if you have a PDF you can still OCR it with minerU, it will be faster in fact because it can extract the text layer (I did the wretched of the earth in 4 minutes flat with it). The problem with PDFs is if you copy the text outright it will look weird because of how PDF handles text. MinerU is also content-aware, meaning it will remove the headers and footers if there are any, which is why I recommend it. It should also normally preserve tables (very important in some books) and styling such as italics and bold, which a simple copy-paste doesn't. Basically if you copy PDF raw it looks like this:
As you can see the lines break weirdly, there's a random page number and chapter reminder in the middle, and it's missing some bolded text (book is Jeff Hawkin's On Intelligence).
LLMs can actually work very well with raw PDF text and clean it up for you, but if the text is really chopped up it might need a cleaner copy to start with. But maybe if you want to skip installing a bunch of stuff for minerU to work this could be attempted. Or like I said, if your party is open to the idea, ask them to send you the raw docx files which I'm sure they have (they probably import them into InDesign, and if they don't, they should), and you can just upload that to deepseek and it will take care of the formatting for you.
Otherwise I'm putting the rest down here in a subsection:
spoiler long comment
Getting minerU to work
If you're on windows (which I assume bc you say you are not tech-savvy) you will need to install python 3.12 here https://www.python.org/downloads/release/python-31312/ (scroll down at the bottom for windows installers). During installation make sure to have admin perms and check the "put python in PATH" checkbox or similar (it will say something about PATH).
Once python is installed you can install minerU by opening the cmd, and type
pip install mineru[all](or maybe python pip install mineru[all]). It will take some time but it will install minerU as needed on your computerOnce minerU is installed, in the same cmd window, run
mineru-models-download. Once again it will take some time as it installs a bunch of models. Expect it to take around 11 gigabytes of memory on your disk in total.Once everything is installed, you can simply run an OCR command through this command:
mineru -p /path/to/your/document.pdf -o /path/to/output/folder -l [language]again from the CMD window. But you can do that at any time, you don't need to reinstall everything we just did each time.Language is specified as [ch|ch_server|ch_lite|en|korean|japan|chinese_cht|ta|te|ka|th|el|latin|arabic|east_slavic|cyrillic|devanagari] and means alphabet/writing system, so I think "latin" for hungarian should work. Otherwise it defaults to chinese
If at any point during the installation or trying to use minerU something isn't clear or you get an error output in the CMD, just send the entire output to deepseek and it will tell you what to do. Use the expert mode with search on. I myself installed minerU through copying the commands Deepseek gave me, didn't even need to hunt down anything. I ran into a bug then when trying to run it, sent the output to deepseek, and it found the fix in 2 seconds (installing python development version). I can't stress how stress-free installing technical software has become.
But after that you can quickly and easily OCR any PDF or image on your computer, don't forget to specify a specific folder for the output as minerU creates a bunch of files, including a markdown file and a JSON. That's what its OCR output looks like. With pandoc, which is yet another piece of software to install, you can then transform that .md (markdown file) into another without a hitch. To install pandoc on Windows, download it here https://pandoc.org/installing.html (click 'get the latest installer' then look for pandoc-3.9.0.2-windows-x86_64.msi) and then you can use the command
pandoc text.md -t -o conversion.[extension]Pandoc is a really thorough program that can convert any text from one format to another, such as html to wikitext to markdown to epub to pdf to XML to whatever. You can find demos here that showcase some conversions: https://pandoc.org/demos.html. XML is what Word and LibreOffice use behind the .docx extension, so if you convert to XML you can probably easily open it in Word afterwards.
If I'm not mistaken if you put the [extension to your output file as .docx for example pandoc should automatically know that it has to convert to docx xml.
So basically minerU OCR's the PDF into usable text, but in the markdown format. Then with pandoc, you can clone that markdown into a bunch of different other formats, if that makes sense. Keep the markdown format MinerU makes (which is the same styling language we use on Lemmygrad btw), and reconvert it with pandoc into anything you need.
I did all of this myself the other day for the Wretched of the Earth and it worked really well! Just needs some manual cleaning up afterwards, but that's usually just on the chapter titles and because of the PDF files themselves.
MinerU can also run on your CPU (loaded in RAM) if your GPU can't handle it, you can find the different options by just typing 'mineru --help', and it will tell you how to pick GPU or CPU. I think you need an Nvidia GPU, otherwise use CPU and it should work well too (just takes longer).
Fine-tuning your translation
I've been attempting translation work with LLMs for the past few years and I still haven't found something I'm 100% happy with, though that 2-pass thing (first pass is the translation, second pass is a new conversation where you ask it to proofread and localize like an editor would) yields better results. This is kind of similar to the 'critique' they do in training, where you have the model being trained generating an output, and then another model 'critiques' it to find problems, and the model in training has to improve to fool the critique. I would try things around this concept, like sending a model both the original text and the LLM translation and asking it to compare, proofread, and fix.
LLMs are not great with all languages because they don't necessarily train on those languages. So it'll really depend on the model, you should try a few with the same prompt and input text (just one page of a book is fine, preferably one that is representative of the difficulty of the task). Then once you find a model that seems to handle hungarian fine, refine your prompt when you send it the text. It might be a very long prompt. You might have to include a glossary of technical terms that need to be translated the same way each time, and you might need to specify a bunch of other stuff like what sort of language register to use etc.
And basically you refine your prompt bit by bit like this until you get something that seems "good enough" for you. I find that it's important to tell them to "write naturally without changing the content or the ideas - you are an editor, not an author" or something like that.
Once you're happy with your prompt though you can save it somewhere on your computer to always have it around, and just reuse it each time.
As for language pairs yes it could probably work both ways. I.e. if you can get hungarian translated to english in good quality (by an LLM), you can probably get the LLM to also translate english to hungarian. older methods and humans are more finicky lol, but in my opinion LLMs should have no problem with language pairs as long as they know one of the two languages sufficiently.
Agentic pipeline
The pipeline python scripts I was talking about is, you guessed it, more LLM stuff. Join us over on !crushagent@lemmygrad.ml to learn how to start using agentic on your computer. But basically put 5$ in the deepseek API, install crush, and then have the agent code you a bundle of scripts to automate most of the process for you. It's what I did to get ProleWiki translated to French, it's a collection of 4 different python scripts, all LLM-coded, to 1. download our pages, 2. translate them intelligently with an LLM (with progress tracking, cutting up big files into chunks etc), 3. clean up the translation artefacts due to the model and 4. upload the translated pages to ProleWiki.
You don't need to know computers or how to code anymore to have this kind of stuff and I think that's pretty cool. It definitely helps but for something simple like that you don't need to be too technical. The code might get complex, but you let the AI handle it. You're the client for the script, you don't need to know how it works, just that it does.
It's more involved but then you could have a mostly automated pipeline that runs minerU on the pdfs, sends them to an LLM API to get them translated (like I have), then labels and saves the translations or something. That way instead of doing every step yourself you just run the script.
But if you have PDFs you could probably just feed them manually to Deepseek tbh, by just uploading them in the chatbox.