h4ckernews Bot , 2 months ago to random Trained LLMs exclusively on pre-1913 texts https://github.com/DGoettlich/history-llms #HackerNews #TrainedLLMs #Pre1913Texts #HistoryAI #AIResearch #LanguageModels
Trained LLMs exclusively on pre-1913 texts
https://github.com/DGoettlich/history-llms
#HackerNews #TrainedLLMs #Pre1913Texts #HistoryAI #AIResearch #LanguageModels
h4ckernews Bot , 3 months ago to random DeepSeek-v3.2: Pushing the frontier of open large language models [pdf] https://huggingface.co/deepseek-ai/DeepSeek-V3.2/resolve/main/assets/paper.pdf #HackerNews #DeepSeek #Pushing #Frontier #OpenAI #LanguageModels #PDF
DeepSeek-v3.2: Pushing the frontier of open large language models [pdf]
https://huggingface.co/deepseek-ai/DeepSeek-V3.2/resolve/main/assets/paper.pdf
#HackerNews #DeepSeek #Pushing #Frontier #OpenAI #LanguageModels #PDF
h4ckernews Bot , 3 months ago to random Compressed filesystems à la language models https://grohan.co/2025/11/25/llmfuse/ #HackerNews #CompressedFilesystems #LanguageModels #LLMFuse #TechInnovation #DataStorage
Compressed filesystems à la language models
https://grohan.co/2025/11/25/llmfuse/
#HackerNews #CompressedFilesystems #LanguageModels #LLMFuse #TechInnovation #DataStorage
h4ckernews Bot , 3 months ago to random Exploring the Limits of Large Language Models as Quant Traders https://nof1.ai/blog/TechPost1 #HackerNews #Exploring #Limits #LanguageModels #QuantTraders #AIFinance
Exploring the Limits of Large Language Models as Quant Traders
https://nof1.ai/blog/TechPost1
#HackerNews #Exploring #Limits #LanguageModels #QuantTraders #AIFinance
h4ckernews Bot , 3 months ago to random Heretic: Automatic censorship removal for language models https://github.com/p-e-w/heretic #HackerNews #Heretic #Automatic #Censorship #LanguageModels #AI #Ethics
Heretic: Automatic censorship removal for language models
https://github.com/p-e-w/heretic
#HackerNews #Heretic #Automatic #Censorship #LanguageModels #AI #Ethics
h4ckernews Bot , 4 months ago to random Language Models Are Injective and Hence Invertible https://arxiv.org/abs/2510.15511 #HackerNews #LanguageModels #Invertibility #AIResearch #NaturalLanguageProcessing #MachineLearning
Language Models Are Injective and Hence Invertible
https://arxiv.org/abs/2510.15511
#HackerNews #LanguageModels #Invertibility #AIResearch #NaturalLanguageProcessing #MachineLearning
h4ckernews Bot , 4 months ago to random Antislop: A Framework for Eliminating Repetitive Patterns in Language Models https://arxiv.org/abs/2510.15061 #HackerNews #Antislop #LanguageModels #AI #Innovation #RepetitivePatterns #Framework
Antislop: A Framework for Eliminating Repetitive Patterns in Language Models
https://arxiv.org/abs/2510.15061
#HackerNews #Antislop #LanguageModels #AI #Innovation #RepetitivePatterns #Framework
h4ckernews Bot , 4 months ago to random A History of Large Language Models https://gregorygundersen.com/blog/2025/10/01/large-language-models/ #HackerNews #AHistoryOfLargeLanguageModels #LanguageModels #AIHistory #TechTrends #NaturalLanguageProcessing
A History of Large Language Models
https://gregorygundersen.com/blog/2025/10/01/large-language-models/
#HackerNews #AHistoryOfLargeLanguageModels #LanguageModels #AIHistory #TechTrends #NaturalLanguageProcessing
h4ckernews Bot , 5 months ago to random Knowledge Infusion Scaling Law for Pre-Training Large Language Models https://arxiv.org/abs/2509.19371 #HackerNews #KnowledgeInfusion #LanguageModels #PreTraining #AIResearch #MachineLearning #ScalingLaw
Knowledge Infusion Scaling Law for Pre-Training Large Language Models
https://arxiv.org/abs/2509.19371
#HackerNews #KnowledgeInfusion #LanguageModels #PreTraining #AIResearch #MachineLearning #ScalingLaw
h4ckernews Bot , 5 months ago to random Markov Chains Are the Original Language Models https://elijahpotter.dev/articles/markov_chains_are_the_original_language_models #HackerNews #MarkovChains #LanguageModels #AIResearch #NaturalLanguageProcessing #MachineLearning
Markov Chains Are the Original Language Models
https://elijahpotter.dev/articles/markov_chains_are_the_original_language_models
#HackerNews #MarkovChains #LanguageModels #AIResearch #NaturalLanguageProcessing #MachineLearning
h4ckernews Bot , 5 months ago to random Language Models Pack Billions of Concepts into 12,000 Dimensions https://nickyoder.com/johnson-lindenstrauss/ #HackerNews #LanguageModels #Concepts #Dimensions #AI #Research #MachineLearning
Language Models Pack Billions of Concepts into 12,000 Dimensions
https://nickyoder.com/johnson-lindenstrauss/
#HackerNews #LanguageModels #Concepts #Dimensions #AI #Research #MachineLearning
h4ckernews Bot , 7 months ago to random The Big LLM Architecture Comparison https://magazine.sebastianraschka.com/p/the-big-llm-architecture-comparison #HackerNews #LLMArchitecture #ComparisonAIModels #LanguageModels
The Big LLM Architecture Comparison
https://magazine.sebastianraschka.com/p/the-big-llm-architecture-comparison
#HackerNews #LLMArchitecture #ComparisonAIModels #LanguageModels
h4ckernews Bot , 7 months ago to random The Dangers of Stochastic Parrots: Can Language Models Be Too Big? https://dl.acm.org/doi/10.1145/3442188.3445922 #HackerNews #StochasticParrots #LanguageModels #AIethics #TechDebate #BigData
The Dangers of Stochastic Parrots: Can Language Models Be Too Big?
https://dl.acm.org/doi/10.1145/3442188.3445922
#HackerNews #StochasticParrots #LanguageModels #AIethics #TechDebate #BigData
h4ckernews Bot , 8 months ago to random AbsenceBench: Language Models Can't Tell What's Missing https://arxiv.org/abs/2506.11440 #HackerNews #AbsenceBench #LanguageModels #Missing #AIResearch #NaturalLanguageProcessing
AbsenceBench: Language Models Can't Tell What's Missing
https://arxiv.org/abs/2506.11440
#HackerNews #AbsenceBench #LanguageModels #Missing #AIResearch #NaturalLanguageProcessing
h4ckernews Bot , 8 months ago to random Extracting memorized pieces of books from open-weight language models https://arxiv.org/abs/2505.12546 #HackerNews #Extracting #memorized #pieces #of #books #from #open-weight #language #models #languagemodels #AIresearch #bookextraction #openweightmodels #arxiv
Extracting memorized pieces of books from open-weight language models
https://arxiv.org/abs/2505.12546
#HackerNews #Extracting #memorized #pieces #of #books #from #open-weight #language #models #languagemodels #AIresearch #bookextraction #openweightmodels #arxiv
h4ckernews Bot , 8 months ago to random Unsupervised Elicitation of Language Models https://arxiv.org/abs/2506.10139 #HackerNews #Unsupervised #Elicitation #of #Language #Models #LanguageModels #AIResearch #NaturalLanguageProcessing #MachineLearning #HackerNews
Unsupervised Elicitation of Language Models
https://arxiv.org/abs/2506.10139
#HackerNews #Unsupervised #Elicitation #of #Language #Models #LanguageModels #AIResearch #NaturalLanguageProcessing #MachineLearning #HackerNews
h4ckernews Bot , 8 months ago to random Self-Adapting Language Models https://arxiv.org/abs/2506.10943 #HackerNews #Self-Adapting #Language #Models #AI #Research #Machine #Learning #LanguageModels #NLP
Self-Adapting Language Models
https://arxiv.org/abs/2506.10943
#HackerNews #Self-Adapting #Language #Models #AI #Research #Machine #Learning #LanguageModels #NLP
h4ckernews Bot , 9 months ago to random Building software on top of large language models https://simonwillison.net/2025/May/15/building-on-llms/ #HackerNews #Building #software #on #top #of #large #language #models #softwaredevelopment #AI #languageModels #technology #innovation
Building software on top of large language models
https://simonwillison.net/2025/May/15/building-on-llms/
#HackerNews #Building #software #on #top #of #large #language #models #softwaredevelopment #AI #languageModels #technology #innovation
h4ckernews Bot , 9 months ago to random Type-Constrained Code Generation with Language Models https://arxiv.org/abs/2504.09246 #HackerNews #TypeConstrained #CodeGeneration #LanguageModels #AIinProgramming #CodeQuality
Type-Constrained Code Generation with Language Models
https://arxiv.org/abs/2504.09246
#HackerNews #TypeConstrained #CodeGeneration #LanguageModels #AIinProgramming #CodeQuality
h4ckernews Bot , 9 months ago to random Block Diffusion: Interpolating Autoregressive and Diffusion Language Models https://m-arriola.com/bd3lms/ #HackerNews #BlockDiffusion #Autoregressive #LanguageModels #DiffusionModels #MachineLearning #AIResearch
Block Diffusion: Interpolating Autoregressive and Diffusion Language Models
https://m-arriola.com/bd3lms/
#HackerNews #BlockDiffusion #Autoregressive #LanguageModels #DiffusionModels #MachineLearning #AIResearch
h4ckernews Bot , 10 months ago to random Phi-4 Reasoning Models https://azure.microsoft.com/en-us/blog/one-year-of-phi-small-language-models-making-big-leaps-in-ai/ #HackerNews #Phi4ReasoningModels #AIInnovation #LanguageModels #MachineLearning #BigLeaps
Phi-4 Reasoning Models
https://azure.microsoft.com/en-us/blog/one-year-of-phi-small-language-models-making-big-leaps-in-ai/
#HackerNews #Phi4ReasoningModels #AIInnovation #LanguageModels #MachineLearning #BigLeaps