这是用户在 2025-6-4 14:11 为 https://www.reddit.com/r/LocalLLaMA/comments/1ky14jn/open_source_alternative_to_notebooklm/ 保存的双语快照页面,由 沉浸式翻译 提供双语支持。了解如何保存?
Skip to main content Open Source Alternative to NotebookLM : r/LocalLLaMA
r/LocalLLaMA icon
Go to LocalLLaMA

Open Source Alternative to NotebookLM
NotebookLM 的开源替代品

Other

For those of you who aren't familiar with SurfSense, it aims to be the open-source alternative to NotebookLMPerplexity, or Glean.
对于不熟悉 SurfSense 的你们,它的目标是成为 NotebookLM、Perplexity 或 Glean 的开源替代品。

In short, it's a Highly Customizable AI Research Agent but connected to your personal external sources search engines (Tavily, LinkUp), Slack, Linear, Notion, YouTube, GitHub, and more coming soon.
简单来说,它是一个高度可定制的 AI 研究代理,但连接到你的个人外部资源,包括搜索引擎(Tavily、LinkUp)、Slack、Linear、Notion、YouTube、GitHub 等,未来还将有更多功能。

I'll keep this short—here are a few highlights of SurfSense:
我会保持简短——以下是 SurfSense 的一些亮点:

📊 Features   📊 功能

  • Supports 150+ LLM's
    支持超过 150 种 LLM

  • Supports local Ollama LLM's or vLLM.
    支持本地 Ollama LLM 或 vLLM。

  • Supports 6000+ Embedding Models
    支持超过 6000 种嵌入模型

  • Works with all major rerankers (Pinecone, Cohere, Flashrank, etc.)
    适用于所有主要重排序器(Pinecone、Cohere、Flashrank 等)

  • Uses Hierarchical Indices (2-tiered RAG setup)
    采用分层索引(2 级 RAG 设置)

  • Combines Semantic + Full-Text Search with Reciprocal Rank Fusion (Hybrid Search)
    结合语义搜索+全文搜索与互惠排序融合(混合搜索)

  • Offers a RAG-as-a-Service API Backend
    提供 RAG 即服务 API 后端

  • Supports 34+ File extensions
    支持 34+文件扩展名

🎙️ Podcasts    🎙️ 播客

  • Blazingly fast podcast generation agent. (Creates a 3-minute podcast in under 20 seconds.)
    极速播客生成代理。(在 20 秒内创建 3 分钟播客。)

  • Convert your chat conversations into engaging audio content
    将您的聊天对话转换为引人入胜的音频内容

  • Support for multiple TTS providers (OpenAI, Azure, Google Vertex AI)
    支持多种 TTS 服务提供商(OpenAI、Azure、Google Vertex AI)

ℹ️ External Sources
ℹ️ 外部来源

  • Search engines (Tavily, LinkUp)
    搜索引擎(Tavily、LinkUp)

  • Slack

  • Linear

  • Notion

  • YouTube videos   YouTube 视频

  • GitHub

  • ...and more on the way
    更多内容即将推出

🔖 Cross-Browser Extension
🔖 跨浏览器扩展

The SurfSense extension lets you save any dynamic webpage you like. Its main use case is capturing pages that are protected behind authentication.
SurfSense 扩展允许你保存任何你喜欢的动态网页。它的主要用途是捕获受身份验证保护页面。

Check out SurfSense on GitHub: https://github.com/MODSetter/SurfSense
在 GitHub 上查看 SurfSense:https://github.com/MODSetter/SurfSense


Sort by:
Best
Open comment sort options

Can you customize the length of the podcasts? I generally enjoy one or two hour sessions, and the idea of a three minute podcast isn't useful to me.
你能自定义播客的长度吗?我通常喜欢一两个小时的单次会话,三分钟的播客对我来说没什么用。

}

Hey it's doable should be done in few weeks 👍
这可行,应该几周内完成👍

}

I'll be excited to see it! I'll keep my eye on the project. Thanks for the response.
我会很期待看到它!我会持续关注这个项目。谢谢你的回复。

}
More replies
More replies
Profile Badge for the Achievement Top 1% Commenter Top 1% Commenter

I tried this out but it asks for a api key from site called Unstructured.io, which after I logined with my google account, the site insists me to fill in data so I can request a sales demo. no access.
我试了一下,但它要求从名为 Unstructured.io 的网站获取 API 密钥,登录我的谷歌账户后,该网站坚持让我填写数据才能申请销售演示。无法访问。

Since this pipeline relies on that to do file-parsing, I eventually gave it up.
因为这个管道依赖于它来解析文件,我最终放弃了。

The repo itself seems legit so wish best luck for the maintainers, just unfortunate one of the dependency changed their usage flow.
这个仓库本身看起来很可靠,祝维护者一切顺利,只是不幸的是其中一个依赖项更改了他们的使用流程。

}

Man sorry about this but for some reason unstructured.io started limiting sign ups a few days back. I am adding support of LlamaParse atm. Should be done in a day or two.
很抱歉,但不知为何 unstructured.io 前几天开始限制注册。我现在正在添加 LlamaParse 的支持,应该在一两天内完成。

}

It would be nice if there were open source/local options.
如果有开源/本地选项那就好了。

}

noted I guess I will add docling support as well.
明白了,我想我也会添加 docling 的支持。

}
More replies
More replies
More replies

Looks neat, keep up the good work.
看起来很棒,继续保持良好的工作。

}

🙏🙏

}
More replies

Does it support multimodal RAG?
它支持多模态 RAG 吗?

}

Not right now but I plan to .... Give me a few good examples of the multimodal RAG system according to you.
目前还不支持,但我计划……请你根据你的看法,给我几个多模态 RAG 系统的好例子。

}

I'm still trying to have a multimodal RAG for myself, for 2 primary use cases:
我还在尝试为自己实现一个多模态 RAG,主要用于两个主要场景:

1 - to analyse online game matches. To input images from the game, alongside with text and then be able to retrieve "smart information" about the game and the match. (specific)
1 - 分析在线游戏比赛。输入游戏中的图像,以及文本,然后能够检索关于游戏和比赛的"智能信息"。(具体)

2 - to be able to analyse charts, drawings and then retrieve information about them from the RAG (general)
2 - 能够分析图表、绘图,然后从 RAG(通用)中检索相关信息

Is the Colpali technic or method.
这是 Colpali 技术或方法。

}

Understood will try to get this done in a month or two :)
Understood 将会尝试在一个月或两个月内完成这件事 :)

}
More replies
More replies
More replies
[deleted]

Comment deleted by user

Edited
Profile Badge for the Achievement Top 1% Commenter Top 1% Commenter

```
🔔 Privacy & Local LLM Support

Works Flawlessly with Ollama local LLMs.
```

Sadly, Ollama currently doesn't work with their Docker installation method, as indicated in official installation documentation. Might take a bit hassle if you want to go local with this

Did you tried http://host.docker.internal:11434

export OLLAMA_HOST=

More replies
[deleted]

Comment deleted by user

Will look into this but not a priority right now.

[deleted]

Comment deleted by user

That em dash and your post history are a huge tell

Bro's prompt is not that good.

More replies

Looks cool   看起来很酷

}

🙏🙏

}
More replies

Stupid question, how have you implemented youtube?
愚蠢的问题,你是怎么实现 YouTube 的?

}

Using this https://pypi.org/project/youtube-transcript-api/
使用这个 https://pypi.org/project/youtube-transcript-api/

}

Ho cool thanks   太酷了,谢谢

}
More replies
More replies

Thanks for sharing SurfSense—it's great to see more privacy-focused AI tools emerging!
感谢分享 SurfSense——很高兴看到更多注重隐私的 AI 工具涌现!

For Mac users looking for a simpler, offline option, Elephas lets you create collections ('Brains') of your own docs, notes, and videos, and then semantically search or chat with them—all without your data leaving your device (unless you opt in to your own cloud provider). Might be worth a look if you're seeking a focused, privacy-first alternative that works out of the box on personal files.
对于寻找更简单、离线选项的 Mac 用户,Elephas 允许你创建包含自己文档、笔记和视频的集合('Brains'),然后对它们进行语义搜索或聊天——所有操作均不将你的数据离开设备(除非你选择接入自己的云服务提供商)。如果你在寻找一个专注隐私、开箱即用的个人文件解决方案,这可能值得一试。

It does support Ollama based models, in fact we have built a interface for Ollama for Mac.
事实上,它支持基于 Ollama 的模型,我们已经为 Mac 构建了一个 Ollama 接口。

}

specs needed?   需要规格吗?

}