Please add your ideas - infrastructure planing thread #38
Replies: 12 comments 12 replies
-
Aren’t GitHub Discussions better suited for this? |
Beta Was this translation helpful? Give feedback.
-
I like the idea of tools. I'm very interested in the programming chain, but the challenge is to run it in a sandboxed environment I suppose? |
Beta Was this translation helpful? Give feedback.
-
Love the project and idea to open source perplexity 👍 |
Beta Was this translation helpful? Give feedback.
-
Is this "retrieval augmented generation" (RAG), and -- with tools -- "LLM agent"? I didn't see these terms mentioned in the document, so I was wondering if I'm missing something. |
Beta Was this translation helpful? Give feedback.
-
I'd love the ability to add local folders (Obsidian notes), Google Drive/Dropbox locations to search. |
Beta Was this translation helpful? Give feedback.
-
I combined this with distrobox to setup an ubuntu linux container for rocmn support with 7900xtx in ollama: Used the ini from this ROCm/ROCm#2990 (comment) It does need some limits, this froze up my machine :D |
Beta Was this translation helpful? Give feedback.
-
what does it uses to search on internet? |
Beta Was this translation helpful? Give feedback.
-
Non web data sources.
Just a lot of places that you can pull data from. Maybe creating a data template / format that can be ingested into the vector db would be ideal, then writing downloadeders that take the data and reformat it for the vector DB would be the easiest way to implement this. Also I would like to see an open source replacement to Wolfram Alpha, specifically the math solving part of it but that seems like a different much harder task unto itself. |
Beta Was this translation helpful? Give feedback.
-
Hey, im going to recreate the entire Front End of Perplexity, ill post it here once done, Should be done by the 26th of April |
Beta Was this translation helpful? Give feedback.
-
I open discussion on partitioning the memory/vectordb:
and thankyou @nilsherzig this is working very very well, even in that early state. |
Beta Was this translation helpful? Give feedback.
-
add an url for embedding models.to overcome the need to load/unload/change model permanently, introduce a embedding-urli.e. we can run a
text embeddings are needed by many AI-APPS, so that it's usefull to have a dedicated endpoint on your Machine. |
Beta Was this translation helpful? Give feedback.
-
For those interested but didn't know that it exists. It is totally free So all beatifulsoaping is obsolete even images will be converted with ALT text. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Please comment if you have any thoughts on this:
We have a "chat layer" which has a history of the user's prompts and messages (green and purple). If needed, this chat chain can call other chains, which run autonomous without user interaction by using a self critique loop.
It would be very easy to add more chains to this, like a "programming chain" using deepseek-coder as the LLM.
I would have to come up with a config format for these chains, but essentially they are just a couple of conditions and strings in / strings out.
Beta Was this translation helpful? Give feedback.
All reactions