Thanks for highlighting the LLMlingua model. Text files will forever be larger than LLM contexts. Strategies like LLMLingua can help compress text files in such a way that information is not lost so that LLMs can summarize without running out of context.
Although the compressed text is still human readable, do you think it may be preferable to save the original text for human readability i.e whether 2 versions of text should be saved, one for humans and another for LLMs.
Usually, if you need compression, you're dealing with data at a scale humans can't easily process anyway. So, once you've tested it and know the compression works for your use case, there's really no benefit to storing a human-readable version alongside it.
Thanks for highlighting the LLMlingua model. Text files will forever be larger than LLM contexts. Strategies like LLMLingua can help compress text files in such a way that information is not lost so that LLMs can summarize without running out of context.
Although the compressed text is still human readable, do you think it may be preferable to save the original text for human readability i.e whether 2 versions of text should be saved, one for humans and another for LLMs.
Usually, if you need compression, you're dealing with data at a scale humans can't easily process anyway. So, once you've tested it and know the compression works for your use case, there's really no benefit to storing a human-readable version alongside it.