Today's Deep-Dive: BookWyrm
Ep. 371

Today's Deep-Dive: BookWyrm

Episode description

What if tracking your reading didn’t mean handing your habits, reviews, and recommendations over to a giant corporate platform? In this episode, we explore BookWyrm, an open-source, federated social reading platform that offers a community-driven alternative to services like Goodreads.

BookWyrm lets readers track books, log progress, write reviews, set reading goals, and share updates - but with a fundamentally different philosophy. Instead of feeding algorithms and ad systems, it is designed for high-trust communities, where recommendations come from real people and users control who can see their posts through granular privacy settings.

A major part of BookWyrm’s power comes from federation. Built on the ActivityPub protocol, individual BookWyrm servers can communicate with one another while remaining independently owned and operated. That means a small private book club can still interact with readers on other servers - and even connect with people on platforms like Mastodon - without relying on one centralized corporation to control the network.

We also dive into the technology behind it: a backend built with Python, Django, and PostgreSQL, asynchronous communication handled by Celery and Redis, and a deliberately lightweight front end using Django templates, Bulma, and minimal JavaScript. The result is a platform that prioritizes accessibility, resilience, and user control over trend-driven complexity.

Most importantly, BookWyrm points toward a different future for online culture: one where communities own their infrastructure, preserve their conversations collectively, and build a shared literary record that isn’t dependent on the survival or policies of a single company.

If you’ve ever wanted a social reading platform that feels more like a trusted local book community than a data-mining algorithm, this deep dive into BookWyrm shows what that future could look like.

Gain digital sovereignty now and save costs

Let’s have a look at your digital challenges together. What tools are you currently using? Are your processes optimal? How is the state of backups and security updates?

Digital Souvereignty is easily achived with Open Source software (which usually cost way less, too). Our division Safeserver offers hosting, operation and maintenance for countless Free and Open Source tools.

Try it now!

Download transcript (.srt)
0:00

You know, before we even get into the actual sources today,

0:03

we really need to acknowledge the supporter

0:05

making this deep dive possible.

0:07

Oh, definitely, safe server.

0:08

Right, safe server.

0:10

Because, well, today we're talking about escaping

0:12

these massive corporate data silos.

0:16

Like, think about Goodreads, right?

0:18

Yeah, which is owned by Amazon.

0:19

Exactly, or, you know, posting book reviews on Twitter.

0:22

These are totally closed off ecosystems

0:24

that basically just monetize your reading habits.

0:27

They're machines, yeah, just feeding user data

0:29

into an algorithm.

0:30

And that is exactly where safe server comes in.

0:32

They help organizations find and implement

0:36

open source solutions to replace those exact kinds

0:39

of proprietary monolithic services.

0:42

Which is huge for data sovereignty.

0:43

It really is.

0:44

Like, if you're a business or an association

0:46

and you have strict compliance or data protection

0:48

requirements, you can't just hand your data over to big tech.

0:51

Safe server handles the consulting, the setup,

0:53

and they run it all on secure German servers.

0:55

So you actually own your infrastructure.

0:57

You can check them out at www.safeserver.de.

1:00

And that idea, owning your own space,

1:04

that really brings us to the core of what

1:06

we're looking at today.

1:07

It really does.

1:08

So for today's Deep Dive, our mission

1:11

is to give you, the listener, a beginner-friendly entry

1:14

point into this really revolutionary piece of software

1:18

called Bookworm.

1:19

Yeah, Bookworm.

1:21

With a Y, by the way.

1:22

W-Y-R-M.

1:23

Right.

1:23

And we've got the official Bookworm website

1:25

and their comprehensive GitHub repository documentation

1:30

right here on the table.

1:31

So if you're tired of algorithms dictating your reading list

1:34

and you want to track your books without feeding

1:36

a corporate machine, this is for you.

1:38

OK, let's unpack this.

1:40

Let's do it.

1:40

To start, I just want to ground this

1:42

in the actual user experience.

1:43

Like, what does Bookworm actually

1:45

feel like for a beginner?

1:46

Well, on the surface, it feels very familiar.

1:48

It's a social network for tracking your reading.

1:51

Right.

1:51

You update your status, log when you finish a book,

1:54

set annual goals, that kind of thing.

1:55

Exactly.

1:56

You write reviews.

1:57

But the vibe is entirely different.

1:59

Yeah, reading the docs, the best analogy I could think of

2:01

is, well, imagine walking into one

2:03

of those giant fluorescent-lit mega bookstores,

2:07

where the first thing you see is this massive cardboard display

2:11

for some celebrity memoir, right?

2:13

Bought and paid for by a huge publisher.

2:15

The whole space is a sales funnel.

2:17

Totally.

2:17

You are just a data point to them.

2:20

But Bookworm feels like leaving that fluorescent mega mall

2:24

and stepping into a cozy, dusty, local bookstore.

2:28

I love that.

2:29

Where under certain books, there are actual handwritten index

2:33

cards from the staff.

2:35

If this made you cry, try this one.

2:38

Real human connections.

2:39

And what's fascinating here is that the developers

2:41

are super explicit about that philosophical shift

2:44

in their documentation.

2:45

Oh, really?

2:46

How so?

2:46

They explicitly state that Bookworm

2:49

isn't primarily meant to just be a sterile catalog

2:52

or a universal data source.

2:53

Right, it's not the Library of Congress.

2:55

Exactly.

2:55

I mean, it does function as a database, obviously.

2:58

But it's built specifically for high trust communities.

3:01

They prioritize recommendations from actual people,

3:03

not algorithms.

3:04

Which is such a relief.

3:05

There's no machine learning model in the background

3:08

trying to maximize your screen time by feeding

3:11

you controversial reviews.

3:12

No, your feed is chronological.

3:14

It's just the friends and communities

3:16

you actively choose to follow.

3:17

And there was this phrase in the docs

3:19

that really stuck out to me.

3:21

They encourage casual commentary as you go.

3:24

Yes.

3:26

Because on massive platforms, writing a review

3:30

feels like you're shouting into a stadium.

3:33

You feel like you have to write a Pulitzer winning essay.

3:36

Exactly.

3:36

But here, it's just like leaning over to a friend on the couch

3:39

and saying, I'm on chapter four and this main character

3:43

is driving me insane.

3:44

And you can only really have that casual tone because

3:48

of how they handle privacy.

3:49

Right, the granular privacy settings.

3:51

Yeah, you're not forced to broadcast everything

3:54

to the entire internet.

3:55

You can restrict a reading update

3:57

to just your approved followers or even keep it completely

4:00

private just for yourself.

4:01

Which makes sense.

4:03

Maybe you're reading a dense technical manual for work

4:05

and you want to share notes with colleagues.

4:08

But then on the weekend, you're reading some trashy romance

4:11

novel and you only want your close friends

4:13

to see your commentary.

4:14

You control the dial.

4:16

But OK, let me push back a little here.

4:17

Go for it.

4:18

Because if the whole point is to create

4:20

these small cozy independent spaces,

4:24

how do you not just end up totally isolated?

4:27

The disconnected island problem.

4:28

Yeah.

4:29

Like if I lock myself in a tiny private neighborhood bookshop,

4:32

I can't talk to anyone who doesn't live on my street.

4:35

How does the broader community actually benefit from that?

4:39

If we connect this to the bigger picture,

4:41

this is actually where the magic of federation comes in.

4:44

Bookworm relies on a protocol called Activity Pub.

4:47

Activity Pub.

4:48

Right.

4:49

So Bookworm isn't one giant server sitting in an Amazon

4:53

data center.

4:53

It's software that anyone can install on their own server.

4:56

That's called an instance.

4:58

Got it.

4:58

But because all these separate instances use Activity Pub,

5:01

they speak the same language, they can talk to each other.

5:03

So my private book club instance can still

5:05

talk to the rest of the world.

5:06

Exactly.

5:07

The source material actually gives a really funny example.

5:09

You can run a closed instance just

5:11

for your real world book club.

5:13

OK.

5:13

But you can still follow your friend

5:15

posts on some highly niche server devoted entirely

5:19

to like 20th century Russian speculative fiction.

5:22

That is incredibly specific.

5:24

Right.

5:24

But you can interact singlessly.

5:26

And because it's an open standard,

5:27

it actually goes beyond just Bookworm.

5:30

Wait, what do you mean?

5:31

So Mastodon and Pluroma, which are micro blogging platforms,

5:34

kind of like Twitter Alternative,

5:36

they also use Activity Pub.

5:38

Oh, wow.

5:39

So a friend on Mastodon can actually

5:41

read and comment directly on your Bookworm review,

5:44

right from their Mastodon feed.

5:45

That's crazy.

5:46

So you're not locked into one specific app to talk to people.

5:49

Exactly.

5:50

It's the run your own social philosophy.

5:52

You control your small, high trust network,

5:54

but you're not cut off from the global conversation.

5:57

But wait, how does that work for finding books?

6:00

Like, if everyone is on these tiny scattered servers,

6:03

isn't the database just fragmented?

6:05

You would think so, right?

6:06

Yeah, like if I search for a new Stephen King book,

6:08

but the data for it is on some other server,

6:11

do I just get zero results?

6:13

So this is brilliant.

6:14

Federation actually solves the island problem

6:16

for the database, too.

6:18

When these instances interact, when users share messages

6:21

or reviews, the servers share metadata in the background.

6:25

Oh, so they're basically gossiping with each other.

6:27

Exactly.

6:27

Hey, my user just reviewed this book.

6:30

Here's the title, author, and cover art.

6:32

And then your server saves that.

6:34

Right, it caches it.

6:35

So collaboratively, organically, all these independent servers

6:40

are building a decentralized global database

6:42

of books.

6:43

Without relying on a central corporate server, that is wild.

6:47

It's a huge technical achievement.

6:50

Which naturally brings us to peeking under the hood.

6:53

Because coordinating that kind of decentralized network

6:56

sounds like a nightmare to build.

6:58

It's definitely not simple.

6:59

We've got the GitHub repository stats here.

7:01

Let's talk about the tech stack making this possible.

7:04

Sure.

7:04

So the back end is heavily Python.

7:06

In fact, Python makes up almost 75% of the code base.

7:09

Wow, 74.7% to be exact, according to the repo.

7:13

Right.

7:13

And they use Django as their web framework

7:16

with PostgresWolf for the main database.

7:18

OK, so pretty standard robust tools so far.

7:21

Yes, but the real heavy lifting for federation

7:23

is handled by Celery and Redis.

7:25

OK, and for anyone listening who isn't a back end developer,

7:28

why are Celery and Redis so crucial here?

7:31

Well, the decentralized web is messy.

7:34

Servers go offline, connections drop.

7:36

If your bookworm server tries to send a review

7:39

to a server in Tokyo, and that server is down.

7:42

Your server would just freeze up waiting for it.

7:44

Exactly, if it was synchronous.

7:45

But Celery is an asynchronous task queue.

7:48

It takes that message, hands it to Redis,

7:51

which is like a temporary holding pen,

7:53

and basically says, keep trying to send this in the background,

7:56

but let the user keep clicking around.

7:58

Oh, that makes so much sense.

7:59

So it handles the friction of the real world.

8:02

Exactly.

8:02

They use Docker and Nginx for deployment, too.

8:05

They're just using these incredibly battle-tested

8:08

open source staples to build something really resilient.

8:11

And here's where it gets really interesting to me,

8:13

the front end, what the user actually sees.

8:15

Oh, yeah.

8:16

The docs mention they use Django templates and a CSS

8:19

framework called Bulma.io.

8:22

But they have this one phrase in the developer documentation.

8:25

Vanilla JavaScript in moderation.

8:27

Yes.

8:28

Vanilla JavaScript in moderation.

8:30

I mean, it's just such a sunny, quiet rebellion

8:33

against modern web development.

8:35

It really is.

8:36

Because right now, every corporate site

8:38

is just a massive, bloated JavaScript application

8:41

that drains your phone's battery just to show you text.

8:44

Right.

8:44

Chasing the latest Silicon Valley framework.

8:46

But they're just like, nope.

8:48

Keep it simple.

8:49

Vanilla JS, and only when we need it.

8:52

It perfectly mirrors their whole anti-corporate friendly ethos.

8:56

This raises an important question, though.

8:58

How do you actually build and maintain something

9:00

this complex without a massive corporate budget?

9:03

Seriously.

9:04

I mean, looking at the GitHub repo,

9:06

they have over 2.6 thousand stars, which is great.

9:09

Yeah.

9:09

But the wild part is they have 283 forks and over 105

9:13

individual contributors.

9:14

A project of that size, managing a custom activity

9:17

pub implementation with 105 strangers,

9:20

that takes serious coordination.

9:23

And they do it all out in the open.

9:25

The repository has a contributing .md file

9:27

that explicitly says you don't even

9:29

need to know how to code to help.

9:31

Right, they need people for documentation, translations,

9:33

design, bug testing.

9:34

That's a real grassroots movement.

9:36

And financially, they're entirely

9:38

supported by the community.

9:39

Patreon and Open Collective, right?

9:41

Yeah, exactly, which means no venture capitalists breathing

9:44

down their necks demanding a return on investment.

9:47

And that's the ultimate protection, isn't it?

9:49

As soon as a startup takes VC money, the clock is ticking.

9:52

Eventually, they have to monetize.

9:54

Yep, they sell your data or fill your feed with ads.

9:57

Bookworm doesn't have that pressure.

9:59

So bringing this all together for you, the listener,

10:03

why does all this matter?

10:05

Right, why should you care about Activity Pub and RedisQs?

10:09

Because taking control of your digital life

10:12

doesn't mean you have to give up the joy of sharing

10:14

a good book with friends.

10:15

Exactly, you don't have to choose between a corporate data

10:18

harvesting machine and tracking your books alone

10:20

in an Excel spreadsheet.

10:21

You can choose an environment built for you,

10:23

not for advertisers.

10:25

And before we wrap up, I want to leave you

10:28

with one final thought to sort of mull over something

10:30

that builds on what we've talked about.

10:32

The sources mention that Bookworm collaboratively

10:35

builds this decentralized database of books.

10:38

The federated library.

10:40

Right, but think about the implications of that.

10:42

If our collective reading habits, reviews,

10:44

and literary discussions are stored across thousands

10:47

of independent servers.

10:48

Rather than one corporate database.

10:50

Exactly, how might this permanent, uncensorable,

10:53

federated library change the way human literary history is

10:57

preserved?

10:57

Oh, wow.

10:59

Because if Goodreads shuts down tomorrow,

11:01

millions of reviews vanish.

11:03

It's a single point of failure.

11:04

Corporate censorship, or just a server shutdown,

11:07

and it's all gone.

11:08

But with Bookworm, if one server goes offline,

11:12

the metadata lives on everywhere else.

11:15

It's truly preserving history against corporate decay.

11:18

That is a profound way to look at it.

11:20

The community literally owns the infrastructure.

11:23

And that concept, taking ownership

11:25

of your infrastructure, brings us right back

11:27

to our sponsor, SafeServer.

11:28

It's the exact same principle just applied

11:31

to an organizational level.

11:32

Exactly.

11:33

Because organizations face this exact same dilemma.

11:36

They're locked into proprietary software

11:39

from massive vendors paying exorbitant licensing fees

11:42

and surrendering their operational data.

11:44

And SafeServer provides the escape hatch.

11:47

They really do.

11:48

By switching to open source solutions,

11:49

whether it's Bookworm for a massive book club

11:53

or specialized enterprise tools for a business,

11:55

you gain total autonomy over your community guidelines

11:59

and your user data.

12:00

And honestly, you save a significant amount of money

12:03

by cutting out those proprietary licenses.

12:05

Huge savings.

12:06

So if you're part of a business, an association,

12:09

or any group looking to regain control,

12:11

SafeServer can be commissioned for consulting.

12:13

They will help you find and implement

12:15

the exact right open source fit for your needs

12:17

and operate it securely on German servers.

12:20

Total data sovereignty.

12:21

Head over to www.SafeServer.de to learn more.

12:25

It's time to build a web that actually works for us.

12:27

I couldn't agree more.

12:28

We'll catch you on the next one.

12:28

We'll catch you on the next one.