long-context

Here are 33 public repositories matching this topic...

lucidrains / perceiver-ar-pytorch

Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch

deep-learning transformer artficial-intelligence attention-mechanism long-context

Updated Apr 10, 2023
Python

lucidrains / flash-genomics-model

Star

My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other hierarchical methods)

deep-learning genomics transformers artificial-intelligence attention-mechanisms long-context

Updated Jul 2, 2023
Python

4AI / RAN

Star

RAN: Recurrent Attention Networks for Long-text Modeling | Findings of ACL23

acl recurrent-networks long-context long-context-attention acl2023 long-context-transformers long-document-modeling recurrent-attention-networks

Updated Aug 12, 2023
Python

yangjianxin1 / LongQLoRA

Star

LongQLoRA: Extent Context Length of LLMs Efficiently

lora llm long-context qlora longlora

Updated Nov 12, 2023
Python

asigalov61 / Heptabit-Music-Transformer

Star

[DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instruments range, chords counters and outro tokens

midi artificial-intelligence heptagram heptagon music-transformer music-ai sota-model long-context heptabit

Updated Nov 23, 2023
Python

lucaslingle / e-lra

Star

Streamlined variant of Long-Range Arena with pinned dependencies, automated data downloads, and deterministic shuffling.

transformers long-context long-range-arena

Updated Jan 9, 2024
Python

lucidrains / recurrent-memory-transformer-pytorch

Star

Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch

deep-learning memory transformers artificial-intelligence recurrence attention-mechanisms long-context

Updated Feb 11, 2024
Python

THUDM / LongBench

Star

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

benchmark llm long-context longtext

Updated Feb 27, 2024
Python

bigai-nlco / LooGLE

Star

ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models

large-language-models llm long-context acl2024

Updated Mar 1, 2024
Python

nopperl / corporate_emission_reports

Star

Finetuning and evaluating LLMs to extract GHG emissions from PDF reports using RAG and grammar-based decoding.

evaluation information-extraction data-extraction lora llm long-context

Updated Mar 22, 2024
TeX

davendw49 / Awesome-Long-Context-Language-Modeling

Star

Papers of Long Context Language Model

nlp awesome-list llm long-context

Updated Mar 28, 2024

Glaciohound / LM-Infinite

Star

Implementation of paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"

language-model model-diagnostics long-context

Updated Apr 10, 2024
Python

melvinebenezer / Liah-Lie_in_a_haystack

Star

needle in a haystack for LLMs

needle-in-haystack llm long-context llm-inference llms-benchmarking

Updated Apr 15, 2024
Python

thunlp / InfLLM

Star

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

large-language-models llm long-context training-free

Updated Apr 20, 2024
Python

open-compass / Ada-LEval

Star

The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"

gpt4 llm long-context

Updated Apr 22, 2024
Python

THUDM / LongAlign

Star

LongAlign: A Recipe for Long Context Alignment Encompassing Data, Training, and Evaluation

alignment llm long-context longtext

Updated Apr 22, 2024
Python

lucidrains / MEGABYTE-pytorch

Star

Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

deep-learning transformers artificial-intelligence attention-mechanisms long-context learned-tokenization

Updated May 3, 2024
Python

dingo-actual / infini-transformer

Star

PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)

deep-learning transformers pytorch attention-mechanism long-context infini-attention mixture-of-depths

Updated May 4, 2024
Python

"Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding" Zhenyu Zhang, Runjin Chen, Shiwei Liu, Zhewei Yao, Olatunji Ruwase, Beidi Chen, Xiaoxia Wu, Zhangyang Wang.

positional-encoding large-language-models long-context lost-in-the-middle

Updated May 7, 2024
Python

sayhitosandy / Mamba_SSM

Star

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

ssm mamba llm long-context generative-ai

Updated May 9, 2024

Improve this page

Add a description, image, and links to the long-context topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the long-context topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

long-context

Here are 33 public repositories matching this topic...

lucidrains / perceiver-ar-pytorch

lucidrains / flash-genomics-model

4AI / RAN

yangjianxin1 / LongQLoRA

asigalov61 / Heptabit-Music-Transformer

lucaslingle / e-lra

lucidrains / recurrent-memory-transformer-pytorch

THUDM / LongBench

bigai-nlco / LooGLE

nopperl / corporate_emission_reports

davendw49 / Awesome-Long-Context-Language-Modeling

Glaciohound / LM-Infinite

melvinebenezer / Liah-Lie_in_a_haystack

thunlp / InfLLM

open-compass / Ada-LEval

THUDM / LongAlign

lucidrains / MEGABYTE-pytorch

dingo-actual / infini-transformer

VITA-Group / Ms-PoE

sayhitosandy / Mamba_SSM

Improve this page

Add this topic to your repo