Neural Network for English-to-Hindi Transliteration

Introduction

This project focuses on building an Encoder-Decoder model with attention for transliteration of English text to Hindi. The goal is to create a neural network model that can accurately convert English words into their corresponding Hindi counterparts. The model is implemented using PyTorch and makes use of the attention mechanism to improve the translation performance.

Dataset

The dataset used for training and evaluation is a collection of English-Hindi word pairs. Each word pair consists of an English word and its transliterated Hindi word. The dataset is prepared specifically for the transliteration task and is available in a suitable format for training the model.

Instructions

Prerequisites

Python 3.9 or higher
PyTorch
NumPy
Matplotlib
scikit-learn

Setup

Clone the repository:

git clone https://github.com/your_username/encoder-decoder-transliteration.git

Change into the project directory:

cd encoder-decoder-transliteration

Install the required dependencies:

pip install -r requirements.txt

Training the Model

To train the Encoder-Decoder model for transliteration, follow these steps:

You may run the train.py help script to view the available options:

python train.py --help

Set the appropriate options for the available params and run the training script.

Monitor the training progress:

During training, the script will display the loss and accuracy metrics for each batch and log them to Weights & Biases (wandb). You can monitor the training progress and visualize the loss and accuracy curves using the generated wandb report. Evaluate the model:

After training, you can evaluate the trained model on a separate validation set or test set using the calc_accuracy function. Modify the evaluation code in the calc_accuracy function to suit your evaluation requirements. Save and use the model:

Once you are satisfied with the training results, you can save the trained model using the torch.save function. The saved model can be loaded and used for transliteration tasks in a separate script or application.

Conclusion

The Encoder-Decoder model with attention implemented in this project provides a solution for transliterating English text to Hindi. By training the model on a suitable dataset and fine-tuning the hyperparameters, accurate transliterations can be obtained. The provided instructions guide you through the process of training the model and using it for transliteration tasks. Feel free to explore and experiment with different settings to improve the model's performance.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
EncoderDecoder		EncoderDecoder
EncoderDecoderWithAttn		EncoderDecoderWithAttn
predictions_attention		predictions_attention
predictions_vanilla		predictions_vanilla
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural Network for English-to-Hindi Transliteration

Introduction

Dataset

Instructions

Prerequisites

Setup

Clone the repository:

Training the Model

Monitor the training progress:

Conclusion

About

Releases

Packages

Languages

iamunr4v31/CS6910-Assignment3

Folders and files

Latest commit

History

Repository files navigation

Neural Network for English-to-Hindi Transliteration

Introduction

Dataset

Instructions

Prerequisites

Setup

Clone the repository:

Training the Model

Monitor the training progress:

Conclusion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages