Text Classification
Transformers
PyTorch
French
camembert
twitter
DistilCamemBERT
sentiment analysis
emojis
text-embeddings-inference
Instructions to use Jessy3ric/camembert-twitter-emoji with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Jessy3ric/camembert-twitter-emoji with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="Jessy3ric/camembert-twitter-emoji")# Load model directly from transformers import AutoTokenizer, AutoModelForSequenceClassification tokenizer = AutoTokenizer.from_pretrained("Jessy3ric/camembert-twitter-emoji") model = AutoModelForSequenceClassification.from_pretrained("Jessy3ric/camembert-twitter-emoji") - Notebooks
- Google Colab
- Kaggle
CamemBERT-base for sentiment analysis on tweets
This is a Camembert-base model trained on a corpus of 50K french tweets.
- Git Repo containing the dataset and the code (scraping & training) : Git
The model can predict which of the 25 emojis it has been trained with suits the best on a given sentence / tweet. The 25 emojis are the 25 most frequent in the dataset.
We've succeeded to obtain a 32% accuracy on a small amount of tweets.
Note: We've also decided to keep the emojis in their demojized versions because some emojis could be seen as two (ex : 👍🏿)
Loading the model
from transformers import AutoModelForSequenceClassification, AutoTokenizer, TFAutoModelForSequenceClassification
MODEL = f"Jessy3ric/camembert-twitter-emoji"
tokenizer = AutoTokenizer.from_pretrained(MODEL)
model = AutoModelForSequenceClassification.from_pretrained(MODEL)
model.save_pretrained(MODEL)
- Downloads last month
- 4