picohen's comments

picohen · on March 27, 2024

voiceblue · on March 27, 2024

Thanks! As I mentioned, I've been looking into this for a while. I'll add it to my list.

Now I have:

    - vetrec.io
    - Abridge
    - Scribeberry
    - Scribematic
    - Notezap
    - Lytte
    - Deepscribe
    - FreedAI
    - s10 AI
    - Nable
    - DeepCura
    - DAX copilot
    - Suki
    - M*Modal
    - Amazon Healthscribe

Slightly different (?) - maybe more human:

    - Overnight Scribe
    - Rev.ai

picohen · on Nov 27, 2023

The Shinkansen is FASCINATING. I recently went and was amazed by Tokyo's infrastructure and how they have a city under a city. The fact that there is a bullet train at tokyo station every 10 mins or so is mind blowing

I went into a Youtube rabbit hole the other night...

https://www.youtube.com/watch?v=HdJwAUdvlik

https://www.youtube.com/watch?v=FFpG3yf3Rxk

picohen · on Nov 22, 2023

Postgres is already available :)

picohen · on Nov 22, 2023

In fact, we will be releasing a blog post in the next few days of how we do real-time syncing for a RAG application with postgres hosted on supabase

picohen · on Nov 22, 2023

Gotta start somewhere! Take a look at this one as well! https://news.ycombinator.com/item?id=37824547

picohen · on Nov 21, 2023

Well, of course I'm biased on the answer :). But to give a not-so-biased answer, I would first try to understand what the project is about and whether RAG is a priority in it. If the project is leveraging agents and LLMs without worrying too much on context/up-to-date data then Haystack could be a good option. If the focus is to eventually use RAG then our framework could help.

Additionally, there might be a potential route where both are used, depending on the use case.

Feel free to dm if you want to chat further on this!

zansara · on Nov 22, 2023

Actually Haystack is very focused on RAG lately, just have a look at the latest blog articles: https://haystack.deepset.ai/blog

(Disclaimer: I am a Haystack maintainer)

alchemist1e9 · on Nov 22, 2023

A bit odd that they might not be aware of that. Any ideas from this project you see that might benefit Haystack?

alchemist1e9 · on Nov 21, 2023

I understood Haystack as doing RAG but your comment seems to define it differently than my understanding.

picohen · on Nov 21, 2023

Relevance calculations are handled by the vector db but we try to improve such relevance with the use of metadata (you will see how our components have "selectors" so that metadata can flow all the way to the vector database at the vector level and have an influence when results/scores get retrieved at search time)

eigenvalue · on Nov 21, 2023

Got it. I'd encourage you to expose more of that functionality at the level of your application if possible. I think there is a lot of potential in using more than just cosine similarity, especially when there are lots of candidates and you really want to sharpen up the top few recommendations to the best ones. You might find this open-source library I made recently useful for that:

https://github.com/Dicklesworthstone/fast_vector_similarity

I've had good results from starting with cosine similarity (using FAISS) and then "enriching" the top results from that with more sophisticated measures of similarity from my library to get the final ranking.

picohen · on Oct 15, 2023

Hey! Yes! I am the creator, anything specific you wanted to know about? We published the tech-stack in this tweet https://x.com/kevin_neum/status/1712915693874958604?s=20 but essentially the way it works is: 1. Vercel and nextjs for frontend code and deployment 2. Neum to power the rag pipelines for the chatbot to query up-to-date information which we pull from a variety of sources 2.a) the text embeddings are stored in Weaviate (vector db) 3. We then create a prompt/some code with langchain to help query openai/stream the response back!

picohen · on Oct 15, 2023

our server was overloaded yesterday but we fixed it! let me know if you still have issues.

picohen · on Oct 15, 2023

Thanks! Yes. We ingest from a variety of sources. You can check in the about section of the page but essentially with Neum - Disclaimer, I'm the co-founder of Neum (https://neum.ai) - we power the RAG for the chatbot.

The bulk of the data is getting refreshed by Tweets from all of the candidates. We also pull in data from public sources such as wikipedia and ballotpedia (the bot outputs the sources used) And we also pull in from transcripts of interviews the candidates have had, again, if a piece of indo was used from any of these sources, we show it to the user

picohen · on July 18, 2023

ChatGPT?