Today's Deep-Dive: SurfSense
Ep. 248

Today's Deep-Dive: SurfSense

Episode description

This episode discusses SurfSense, an open-source AI research agent designed to unify fragmented information from various sources into a personalized knowledge base. It integrates with numerous external sources like Slack, Notion, and GitHub, as well as supports over 50 file formats, making it a comprehensive tool for knowledge management. SurfSense allows users to search and interact with their data through natural language queries, providing cited answers for transparency. It emphasizes privacy by supporting local large language models (LLMs), ensuring data stays on the user’s machine. The tool also features advanced retrieval-augmented generation (RAG) techniques for accurate and relevant answers. Additionally, SurfSense offers podcast generation and a browser extension for saving protected web content. The project is actively developed, with community involvement encouraged through Discord and a public roadmap on GitHub. The document concludes by highlighting SurfSense’s potential to transform how users learn and manage information.

Gain digital sovereignty now and save costs

Let’s have a look at your digital challenges together. What tools are you currently using? Are your processes optimal? How is the state of backups and security updates?

Digital Souvereignty is easily achived with Open Source software (which usually cost way less, too). Our division Safeserver offers hosting, operation and maintenance for countless Free and Open Source tools.

Try it now for 1 Euro - 30 days free!

Download transcript (.srt)
0:00

Welcome to the deep dive. This is where we really try to cut through the noise and

0:04

get to those

0:04

genuine aha moments you can actually use. And before we jump into today's topic, a

0:10

big shout

0:10

out to our supporter, Safe Server. They're all about hosting innovative software

0:14

like the kind

0:15

we're talking about today, and they're real champions of digital transformation. If

0:19

you

0:19

need solid hosting solutions, you should definitely head over to www.safeserver.de.

0:26

Okay, so let's get real for a second. We're living in this age of just endless

0:30

information,

0:31

right? How many of you feel like you're constantly juggling articles, research

0:35

papers, your own notes,

0:36

chats, all scattered across, I don't know, like 10 different apps? Oh, absolutely.

0:40

It's a huge

0:40

challenge trying to make sense of it all when everything feels so fragmented.

0:44

Exactly. It

0:45

feels like our knowledge gets broken up. Precisely. And well, that's the exact

0:48

problem SurfSense is

0:49

designed to tackle. Think of it as this really innovative open source alternative

0:54

to tools you

0:54

might know, maybe like Notebook LM or Perplexity. But it adds this crucial personal

1:00

layer. It's

1:01

basically your own personal AI assistant for research and knowledge management. It's

1:06

really

1:06

built to bring clarity back to your specific information world. Right. So for this

1:11

deep dive,

1:12

our mission is really to look at SurfSense from, let's say, a beginner's angle,

1:16

make it super

1:17

accessible. We want to unpack what it is, how it works, and maybe why it could be a

1:23

real game

1:24

changer for anyone trying to learn quickly and deeply, but without getting totally

1:27

overwhelmed.

1:28

We've been digging through the project's GitHub, looking at the official docs, and

1:32

honestly,

1:32

what we found looks pretty transformative. It really does. Okay. So let's unpack

1:36

that a bit more.

1:37

What's the fundamental problem SurfSense is really trying to solve here? And what's

1:42

its core promise?

1:43

Yeah. So at its heart, it's tackling that information fragmentation we just talked

1:47

about.

1:48

Having valuable knowledge stuck in different silos, web articles here, your

1:52

documents there,

1:53

chat history somewhere else, personal notes. It's everywhere. Scattered. Exactly.

1:58

SurfSense's core

1:59

promise is basically to unify all of that. The documentation calls it a highly

2:03

customizable AI

2:04

research agent. The big idea, the real insight, is having your own private

2:10

customizable version

2:12

of something like Notebook LM or Perplexity. Okay. Private and customizable. Yes.

2:17

And the

2:17

key difference isn't just that it's open source, though that's important. It's that

2:21

it deeply

2:22

integrates with your personal knowledge base and connects to external sources, too.

2:26

It's really

2:26

about building your own custom knowledge brain tailored to you. That personal

2:30

integration part

2:32

combined with being open source, that really does sound different. I mean, lots of

2:35

us use tools for

2:36

general web research, right? Yeah. But the idea that SurfSense can pull together

2:40

not just the

2:40

public web, but my specific stuff, my drafts, my work chats, my saved links, that

2:45

feels, well,

2:46

significant. It is. How does that personal touch really change the experience

2:49

compared to other

2:50

tools? Well, it all comes down to context and relevance. Yeah. Massively. Instead

2:55

of just doing

2:56

a generic web search, SurfSense connects your specific personal information, maybe

3:00

project docs,

3:01

team chats on Slack, articles you saved. It connects that with external knowledge.

3:06

Ah, okay,

3:07

so it layers them. Exactly. It creates a truly personalized understanding. I mean,

3:12

imagine having

3:13

a super smart assistant that doesn't just know the internet, but actually knows

3:16

your documents,

3:17

your conversations, your saved stuff. It's like having an AI that's read and

3:23

understood everything

3:24

you personally care about. It can connect dots you might not even see yourself.

3:27

That makes sense. So

3:28

it's not just about finding facts anymore, it's about finding insights that are

3:32

directly relevant

3:33

to your world. Okay, this is where it gets really interesting for me then. How on

3:37

earth does it

3:38

manage to pull all this incredibly diverse stuff together? We're talking, like you

3:41

said, old PDFs,

3:43

maybe live Slack messages. That sounds like a massive technical challenge. It is

3:48

definitely,

3:49

and that's where its connectivity power really comes into play. SurfSense connects

3:55

to a really

3:55

extensive list of external sources, stuff you probably use daily. Like what? Okay,

4:01

so for

4:01

searching dynamically, it uses specialized engines like Tavoli and Linkup. Then for

4:07

all your

4:07

collaboration and project management stuff, it links up with Slack, Linear, Jira,

4:12

ClickUp,

4:12

even Confluence. It basically pulls in the heartbeat of your team's work. Wow. Okay.

4:17

The

4:17

big ones. Yeah. And for knowledge and productivity tools, think Notion, Gmail,

4:22

Google Calendar,

4:22

Airtable. It even taps into media and development sources like YouTube videos,

4:27

GitHub repositories,

4:28

and Discord servers. That's huge. And the docs explicitly say more to come so it's

4:32

clearly

4:33

growing. The main idea is to break down those info silos wherever they pop up in

4:37

your digital life.

4:38

That breadth is incredible. It feels like it's designed to reach into pretty much

4:41

every digital

4:42

corner. But what about the stuff not live online? All those files sitting on our

4:46

hard drives,

4:47

that sort of dark data trapped in weird formats? Yeah, great point. That's critical.

4:52

Sursense has

4:53

what they call multiple file format uploading support. It lets you basically liberate

4:57

content

4:58

from your own personal files and feed it right into your knowledge base. And how

5:01

many formats

5:02

are we talking about? Yeah, this supports over 50 plus file extensions. 50, okay.

5:06

Yeah. So think

5:07

common document types. .pdf, .docx, .txt, .txt, .md, .html, all those. Presentations

5:13

too, .ppt, .pttx,

5:15

spreadsheets and data, .xlsx, .csv, even images like .gpg, .png, .tiff, and crucially

5:24

audio and

5:24

video files .mp3, .mp4, .wav. Those are always supported, so no knowledge gets left

5:30

behind.

5:30

How does it manage that range? Well, it smartly uses different specialized ETL

5:35

services. That

5:35

stands for Extract, Transform, Load, things like Lama Cloud, Unstructured, Dockling.

5:40

These aren't

5:40

just grabbing data. They're intelligently pulling out the meaning, even from

5:43

complex messy documents,

5:44

making sure even obscure files add value. So basically, my scattered old word docs,

5:49

my team

5:49

Slack chatter, web pages I saved ages ago, maybe even YouTube videos I watched, all

5:54

unified,

5:54

all searchable in one place. That's the goal, yeah. The real insight here seems to

5:58

be that

5:59

you can finally interrogate your entire digital history, connecting things you'd

6:03

never realize

6:04

were related, making forgotten info actionable again. Exactly. It's a fundamental

6:08

shift. Okay,

6:09

so now I've got everything flown into SurfSense, all my stuff. What can I actually

6:13

do with it?

6:14

How do I get from this massive unified archive to actual insights, not just a

6:20

bigger pile of data?

6:21

Right. Good question. This is where interacting with it gets really powerful. First

6:26

off,

6:26

you get powerful search. You can quickly find anything within all your saved

6:31

content,

6:32

no matter where it came from originally. And it's not just keywords, it understands

6:36

the meaning

6:37

behind your search. Okay, better than just Corotel plus F everywhere. Much better.

6:42

And can I talk to

6:43

it? Like have a conversation with my own knowledge? Absolutely. That's a core

6:46

feature. You can chat

6:47

with your saved content. You use natural language, just like talking to Siri or

6:52

Alexa, but for your

6:53

own stuff. Okay. And critically, this is super important. You get sighted answers.

6:57

Sighted.

6:58

Meaning, meaning it shows you exactly where the information came from in your

7:03

documents or sources.

7:04

The docs say just like perplexity. Ah, that's huge. It really is. It gives you

7:11

transparency

7:11

and builds trust. You can easily check the source, making sure the insights are

7:15

credible

7:15

and traceable. Okay, that is a huge win. So it's not just getting an answer. It's

7:19

getting an answer

7:20

you can actually verify instantly. Precisely. No more endless digging to double

7:24

check a fact. You

7:25

just ask. Get a concise, cited answer and get to those aha moments faster but with

7:31

confidence.

7:32

That's the idea. Speed plus trust. Now, this sounds incredibly powerful. Handling

7:37

potentially

7:38

very sensitive personal data. That immediately brings up questions about, you know,

7:42

privacy and

7:43

data control. What's under the hood making this work? Especially if I'm concerned

7:47

about keeping

7:48

my information private. That's a perfectly valid concern and privacy is really a

7:52

cornerstone of how

7:53

SurfSense is designed. It tackles this directly with privacy and local LLM support.

7:58

Local LLMs.

7:59

Explain that a bit. Sure. LLMs are large language models. The AI brain's doing the

8:05

understanding

8:05

and generation. Local means you can run them directly on your own computer using

8:11

tools like

8:11

a llama. So SurfSense works flawlessly with these. It means your data and the

8:16

processing can stay

8:17

entirely on your machine. Nothing needs to go to the cloud unless you want it to.

8:21

Okay. That gives

8:22

you real control. Complete control. Plus, it's self-hossible because it's open

8:25

source. You can

8:26

easily set it up and run it yourself locally. You own the whole environment. That's

8:29

a big deal for

8:30

privacy. Absolutely. It's a core design choice, not just a feature add-on. It's

8:34

about giving you

8:34

peace of mind. So what's really fascinating then is how to make sure those cited

8:40

answers are actually

8:41

good and relevant, especially when it's sifting through my specific messy data. We

8:47

hear RAG,

8:47

retrieval, augmented generation thrown around a lot. How does SurfSense's version

8:52

make it

8:52

better than basic search? Right. RAG is the core technique, but the implementation

8:56

matters hugely.

8:57

In simple terms, RAG finds relevant info first, then uses the AI model, the LLM, to

9:03

generate a

9:04

nice answer based on that info. But SurfSense uses advanced RAG techniques. It's

9:09

designed for

9:09

flexibility and precision. It supports, get this, over 100 plus LLMs and 6,000 plus

9:16

embedding models.

9:17

Whoa, that's a lot of options. It is. It means you can really tailor it, and it

9:22

integrates all

9:23

major re-rankers like PineCode, Cohere, FlashRank. Re-rankers? What do they do?

9:28

Good question. So the first retrieval step might pull up, say, 20 possible

9:33

documents. A re-ranker

9:34

then looks closely at those 20 and intelligently reorders them, pushing the most

9:39

relevant ones

9:39

right to the top. This is crucial when dealing with diverse personal data, ensuring

9:44

the answer

9:44

is built on the absolute best context. Got it. Smarter sorting. Exactly. And on top

9:49

of that,

9:49

it uses hybrid search. This combines semantic search, understanding the meaning

9:53

with traditional

9:54

keyword search. Then it fuses these results using something called reciprocal rank

9:58

fusion.

9:59

That's a fancy way of saying it intelligently blends the best of both search types.

10:02

Okay,

10:03

so it's layered intelligence. Precisely. The takeaway for you, the user, is that

10:07

all these

10:08

advanced techniques lead to much more accurate, relevant, and comprehensive answers.

10:12

It's not just

10:13

finding data, it's synthesizing it intelligently within your context. So you get

10:17

cutting-edge AI

10:19

research smarts, but with privacy control and that extra layer of accuracy, like a

10:23

personal research

10:24

lab on your desktop. That's a pretty good way to put it. But what about practicalities?

10:28

Running

10:28

local LLMs, processing all that data, does it need a supercomputer? What are the

10:33

hardware needs?

10:34

That's a fair question. Running local LLMs effectively does benefit from decent

10:39

hardware.

10:41

A good GPU, enough RAM definitely helps performance. But SurfSense is designed to

10:45

be as efficient as

10:47

possible. And the beauty of open source is the community is always working on

10:50

optimization.

10:51

There are guidelines for different setups, so you can kind of tailor your

10:54

deployment to the resources

10:55

you actually have. It aims to give you control, even over resource use. Okay, good

11:00

to know. Now,

11:00

I heard whispers it can do even more than just research. Something about podcasts.

11:05

That sounds

11:05

like a totally different direction. It is. And it's genuinely a pretty cool

11:09

extension of the core idea.

11:11

SurfSense has these podcasts features built in. Podcasting features in a research

11:16

tool. Yeah. We're

11:17

talking about a blazingly fast podcast generation agent. The claim is it can create

11:22

a three-minute

11:23

podcast in under 20 seconds. 20 seconds? Seriously? That's what the docs say. It's

11:29

a huge leap for

11:30

quickly creating or sharing info. And even cooler maybe is that it can convert your

11:35

chat conversations

11:36

into engaging audio content. Whoa. So I could turn a Slack discussion into an audio

11:42

summary. Pretty

11:43

much. Imagine summarizing a complex finding into an audio brief for your commute or

11:47

turning meeting

11:48

notes into a podcast for people who missed it. That's incredibly useful. And it

11:51

supports different

11:52

ways to generate the voice, both local text-to-speech options like CocoaOTTS and

11:57

major cloud providers

11:58

like OpenAI, Azure, Google Vertex AI. So you get choices for voice quality. That is

12:04

a massive

12:04

shortcut for content creation or just sharing info accessibly, making knowledge

12:08

more portable,

12:09

more consumable. Exactly. And you mentioned a cross-browser extension earlier. What

12:13

problem

12:13

does that solve? Ah, yes. The browser extension. It's mainly designed to make

12:18

saving web content

12:19

even easier and more complete. Its big use case is to save any web page you like,

12:25

especially ones

12:26

that are protected beyond authentication. Yeming. Meaning sites you have to log

12:30

into like maybe a

12:31

subscription news site, your company's internal wiki, a research database you pay

12:35

for. Okay.

12:36

Stuff that's normally hard to just save. Right. The extension can often capture

12:40

that content while

12:41

you're logged in and pull it seamlessly into your SurfSense knowledge base. It

12:45

helps bypass those

12:46

annoying access hurdles. So it makes your knowledge base truly comprehensive, even

12:51

pulling from

12:52

paywalled or private sources you have access to. That's the idea. Break down

12:56

barriers, make

12:57

information accessible and actionable for you. Generate audio summaries, save

13:02

tricky web pages.

13:03

It's about giving you command over what you learn and how you use it. This all

13:07

sounds amazing,

13:08

but maybe a bit daunting for someone just starting out. How accessible is it? What's

13:12

the journey

13:13

like to actually get started with SurfSense? And where's the project headed? Yeah,

13:17

accessibility

13:18

has been a big focus. There are a couple of main installation options. The docs

13:23

highlight the

13:23

Docker installation is probably the easiest way. All the dependencies, the tricky

13:28

bits, are packaged

13:29

up nicely in containers. It even includes tools like PG admin for database

13:33

management, simplifying

13:34

things. Okay, Docker makes sense for easy setup. But if you like more control or

13:39

want to understand

13:40

the pieces better, there's also a manual installation route. And importantly, both

13:45

ways come with

13:46

detailed step-by-step guides for Windows, Mac OS, and Linux. So pretty much

13:51

whatever system you're on,

13:52

there should be a path for you. That's great to hear. Accessible for beginners

13:56

across platforms.

13:58

And you said it's still actively developing. That community part of open source is

14:01

often where the

14:02

magic happens. Oh, absolutely. SurfSense is actively being developed. It's not

14:06

static.

14:07

It's constantly improving. And this is where the community really comes in. The

14:11

project encourages

14:12

you to join the SurfSense Discord and help shape the future of SurfSense. So you

14:16

can actually get

14:17

involved. Definitely. It's an open invitation to contribute ideas, suggest features,

14:22

report bugs,

14:22

really influence where it goes next. Plus, for transparency, there's a public

14:27

roadmap right on

14:28

GitHub projects. So anyone can see what's being worked on and what's planned for

14:32

the future.

14:33

Fantastic. So it's not just a tool you download and use. It's more like an evolving

14:37

ecosystem

14:37

you can be part of. Exactly. You get the cutting edge features and you can join a

14:41

community pushing

14:42

the boundaries of personal knowledge management. Right. Well, there you have it. A

14:46

deep dive into

14:46

SurfSense. We've looked at how it aims to be your customizable, private, super

14:52

connected AI research

14:54

partner. How it works to turn all that scattered information, web pages, chats,

14:59

your own files into

15:00

one coherent conversational knowledge base that you control. Which really brings up

15:04

an interesting

15:05

thought, doesn't it? In this information flooded world, how could a tool like SurfSense

15:10

truly

15:11

transform the way you learn, the way you create, or even just how you stay informed

15:16

by putting all

15:16

your knowledge finally at your fingertips? It's definitely something to think about.

15:21

We really

15:21

encourage you to check out SurfSense and see how it might fit into your own

15:25

workflow. And before

15:26

we wrap up, one more big thank you to her supporter, SafeServer. For secure

15:31

software hosting and a

15:32

fantastic support with your digital transformation journey, do make sure to visit

15:36

Thanks so much for joining us on this deep dive. We'll catch you next time.

15:36

Thanks so much for joining us on this deep dive. We'll catch you next time.