TWIL 001

Posted on Dec 17, 2023

TWIL ~ This week I learned"

Highlight is that new Keras 3.0 is out.

nDCG Metric

In the paper about RankZephyr I found out Normalized Discounted Cumulative Gain (NDCG for short). Which seems to be and interesting and used metric for ranking systems.

NDCG (normalized discounted cumulative gain): NDCG is a measure of the effectiveness of a ranking system, taking into account the position of relevant items in the ranked list. It is based on the idea that items that are higher in the ranking should be given more credit than items that are lower in the ranking.

Wiki Page

nDCG vizualization

Vizualizations of Math

Thanks to hacker news (I lost the link to the original thread :() I ran into multiple interesting possibilities how to vizualize and present math stuff:

Gemini

Google released some materials about Gemini and early access to it through GCP (sadly, I was not able to get in). I have not seen the product videos and previews. But the report was easily readable. As we are getting used to - architecture is not revealed :(. On the other side, we got 10 pages of names of contributors. To be fair, Google revealed, they use transformers.

Jeff Bezos on Lex’s podcast

Just wow: https://www.youtube.com/watch?app=desktop&v=DcWqzZ3I2cY&ab_channel=LexFridman. Much more interesting dialogues than the ones with Elon. Just do not know why. It’s full of important stuff - teams for papercuts, truth telling, decision making, order of voicing opinions in meetings, memo vs powerpoint slides…. There is just too much on how to instigate good decisions.

RAG chunking

I started to look into chunking and this is a output I got from Edge GPT assistant:

In a typical retrieval-augmented generation (RAG) application, chunking is the process of breaking down the input text into smaller segments or chunks¹. The chunking strategy plays a crucial role in optimizing the relevance and accuracy of the content retrieved from a vector database⁶.
To create chunks based on semantic context using a Large Language Model (LLM), you can use advanced techniques such as content-based chunking⁷. This involves breaking up text based on meaning and sentence structure, for example, using part-of-speech tagging or syntactic parsing⁷. Another approach is sentence chunking, which consists of breaking the text into complete and grammatically correct sentences⁷.
As for generating metadata about the chunk, you can assign each chunk a unique identifier and a set of attributes, which describe its content, location, and relationship to other chunks⁷. This metadata can be used to optimize the retrieval process and improve the relevance of the results.
Remember, the choice of chunking strategy and the generation of metadata are not trivial tasks. They depend on several factors such as the type and amount of data, the nature and complexity of the queries, and the characteristics and performance of the model⁷. Therefore, it’s important to experiment with different strategies and evaluate their impact on your application’s performance⁵⁶⁷.
Source: Conversation with Bing, 12/12/2023
(1) Retrieval Augmented Generation (RAG) Done Right: Chunking. https://vectara.com/grounded-generation-done-right-chunking/.
(2) Chunking Strategies for LLM Applications | Pinecone. https://www.pinecone.io/learn/chunking-strategies/.
(3) Chunking strategies for more effective RAG through LLM. https://medium.com/aimonks/chunking-strategies-for-more-effective-rag-through-llm-63ae7b046b46.
(4) Benchmarking Evaluation of LLM Retrieval Augmented Generation. https://arize.com/blog-course/evaluation-of-llm-rag-chunking-strategy/.
(5) Effects of Chunk Sizes on Retrieval Augmented Generation (RAG) Applications. https://reframe.is/wiki/Effects-of-Chunk-Sizes-on-Retrieval-Augmented-Generation-%28RAG%29-Applications-8b728c36d005434dba39ad19be9b82cc/.
(6) Build Enterprise Retrieval-Augmented Generation Apps with NVIDIA …. https://developer.nvidia.com/blog/build-enterprise-retrieval-augmented-generation-apps-with-nvidia-retrieval-qa-embedding-model/.
(7) Retrieval Augmented Generation (RAG): A Comprehensive Guide. https://www.datastax.com/guides/what-is-retrieval-augmented-generation.
(8) Semantic Search: Part 2 — Question/Answering system powered by LLM. https://medium.com/nerd-for-tech/semantic-search-part-2-question-answer-system-powered-by-llm-using-langchain-244c4c2e9679.
(9) arXiv:2305.17740v1 28 May 2023. https://arxiv.org/pdf/2305.17740.
(10) Easy Web Scraping and Chunking by Document Elements for LLMs. https://medium.com/unstructured-io/easy-web-scraping-and-chunking-by-document-elements-for-large-language-models-c45d13aca8dd.
(11) Using Langchain, Chroma, and GPT for document-based retrieval-augmented …. https://developer.dataiku.com/latest/tutorials/machine-learning/genai/nlp/gpt-lc-chroma-rag/index.html.
(12) How to customize LLMs like ChatGPT with your own data and documents. https://bdtechtalks.com/2023/05/01/customize-chatgpt-llm-embeddings/.
(13) How Retrieval-Augmented Generation Can Simplify Implementing An LLM. https://www.forbes.com/sites/forbestechcouncil/2023/11/21/how-retrieval-augmented-generation-can-simplify-implementing-an-llm/.