Named Entity Recognition (NER) in Python

What is Named Entity Recognition?

Imagine walking into a library filled with thousands of books, but you’re not looking for just any book—you need specific information, like the name of an author or the date a book was published. This is exactly what Named Entity Recognition (NER) does, but instead of physical books, it works with text data. It picks out specific pieces of information—whether it’s names of people, dates, organizations, or even locations—from vast amounts of unstructured text.

In simple terms, NER is a technique in Natural Language Processing (NLP) that allows machines to automatically identify and classify entities. Think of it as a highlighter that goes through a sea of text and marks important words or phrases, such as “Google” as an organization or “January 2024” as a date.

Why does this matter? Let me tell you why. We live in an age where data is exploding in every industry, and much of this data is unstructured. Emails, articles, reports, social media posts—you name it. NER enables us to extract actionable insights from this data.

For instance, in search engines, NER helps identify what you’re actually asking about. If you search for “Tesla earnings report 2023,” the system knows to look for a company (Tesla) and a specific event (earnings report) tied to a year (2023). Similarly, chatbots use NER to understand what specific entities you’re talking about, whether it’s a product name or a location. This makes interactions more personalized and context-aware.

So, what’s the goal here? By the end of this blog, you’ll know how to implement NER using Python, giving you the ability to build your own tools that extract valuable insights from unstructured data.

Why is NER Important?

You might be wondering, “Okay, so NER sounds cool, but where does it actually make a difference?” Here’s the deal: NER isn’t just another fancy NLP tool—it’s a game-changer across various industries.

Let’s start with healthcare. Doctors and researchers work with mountains of clinical reports, drug trial data, and patient records. With NER, you can automatically extract critical entities like drug names, symptoms, and diagnoses, speeding up the research process and even aiding in drug discovery.

Now, if we move over to finance, it’s all about precision and speed. NER helps banks and investment firms sift through financial reports, news articles, and social media posts to find relevant information like stock tickers, key events (like mergers or acquisitions), and important dates. Imagine being able to extract all mentions of “Apple Inc.” and “quarterly earnings” from thousands of documents in a matter of seconds. That’s what NER brings to the table.

In e-commerce, NER is behind the scenes powering product recommendation engines and improving customer service chatbots. When someone asks, “Do you have that red dress in size medium?” NER helps the chatbot focus on the entities “red dress” and “size medium,” enhancing the shopping experience.

NER also plays a big role in sentiment analysis. Let’s say you’re analyzing customer feedback. You don’t just want to know if a customer is happy or upset—you want to know who or what they’re happy or upset about. That’s where NER steps in. It picks out names of products or services so you can identify trends and sentiment toward specific entities.

How Named Entity Recognition Works

Overview of NER Models

Now, let’s dig a little deeper. You might be curious: How does NER actually work under the hood? It turns out there’s more than one way to slice this pie, and understanding the different approaches will help you choose the right tool for the job.

There are three main types of models used for NER:

Rule-based models: These are like the old-fashioned librarians—meticulous but rigid. They work with predefined rules and patterns, like regular expressions, to find entities. For example, you could write a rule that anything following “Mr.” or “Ms.” is a person’s name. The downside? These systems struggle with flexibility. They don’t handle variations or nuances in language well.
Statistical models: These models use probability to predict which words are entities based on their context. One popular statistical model is Conditional Random Fields (CRF). It looks at the relationships between words in a sentence and predicts the label for each word. CRF models are more adaptable than rule-based ones but can be complex to set up and require labeled training data.
Machine Learning-based models: This is where things start to get interesting. Modern NER systems rely heavily on machine learning, especially with the rise of deep learning. BiLSTM-CRF models, for example, combine the best of both worlds by using deep learning to capture the context of words and CRFs to label them more accurately. Transformers like BERT (Bidirectional Encoder Representations from Transformers) take this a step further by considering the entire sentence context before predicting entities, which makes them incredibly accurate.

Here’s a simple way to think about it: rule-based models are like having a strict teacher who follows a textbook to the letter, statistical models are more like guessing based on previous experience, and machine learning models are like a savvy detective who looks at every clue in the broader context before drawing a conclusion.

NER Challenges

But wait, it’s not all smooth sailing. NER, despite its capabilities, has some challenges you need to be aware of.

Overlapping entities: Imagine trying to extract entities from the sentence, “Apple is located in Cupertino.” Apple could be a fruit or a tech company, depending on the context. NER models need to be context-aware to resolve these overlaps.

Entity disambiguation: This might surprise you, but one of the biggest challenges in NER is deciding what an entity actually refers to. For example, “Jordan” could refer to a person, a country, or even a basketball shoe brand. Handling these ambiguities requires models to be fine-tuned or supplemented with knowledge bases.

Domain-specific terms: Off-the-shelf NER models may perform poorly when applied to niche domains. For example, a medical NER model might miss an entity like “COVID-19” if it hasn’t been trained on relevant data. This is where fine-tuning comes in—you can fine-tune pre-trained models on domain-specific datasets to improve accuracy.

Context sensitivity: In sentences like “He went to Paris in May,” “Paris” could be a city or a person. Pre-trained models, like spaCy or BERT, can handle this to some extent, but you may need to fine-tune these models for high-stakes tasks.

By understanding these challenges, you can better decide whether to use a pre-trained model or invest the time in fine-tuning your own NER model for specific tasks.

Setting Up the Environment

Now that you understand the basics of NER, let’s roll up our sleeves and get our hands dirty. Before we dive into the code, you’ll need to make sure your environment is set up properly. After all, as the saying goes, “You can’t build a house without the right tools.” Here’s what you’ll need to get started with NER in Python.

Installing Required Libraries

To make things easier, Python has several powerful libraries that handle the heavy lifting of NER for you. You don’t have to reinvent the wheel—just install the right packages. Let’s break down the key libraries you’ll want in your toolkit:

spaCy – This is the go-to library for NER in Python. It’s fast, easy to use, and comes with pre-trained models. Whether you’re working on a small-scale project or building something more complex, spaCy will have you covered.

pip install spacy

2. nltk (Natural Language Toolkit) – While NER is not the primary focus of nltk, it does offer basic NER capabilities. It’s a good starting point for understanding traditional NLP tasks.

pip install nltk

3. Transformers – This library from Hugging Face is for those who want to take their NER game to the next level. It provides state-of-the-art models like BERT, GPT, and RoBERTa, which can dramatically improve NER performance in complex tasks.

pip install transformers

4. Flair – If you’re looking for something that combines simplicity with powerful neural network-based NER models, Flair is an excellent choice. It’s particularly good if you want to work with multiple languages or require advanced features.

pip install flair

With just a few commands, you’ll have all the libraries you need to start recognizing entities in text like a pro. And the best part? You don’t need to be a machine learning expert to get started.

Overview of spaCy for NER

You might be asking, “Why spaCy? Why not stick with something like nltk?” Here’s the deal: spaCy is widely considered the preferred library for NER, and for good reasons:

Speed: spaCy is blazing fast. Whether you’re processing thousands of documents or just a few, it’s designed for performance.
Accuracy: It comes with pre-trained models that have been carefully optimized for various languages and tasks, offering great out-of-the-box accuracy.
Ease of use: spaCy’s API is incredibly intuitive. Even if you’ve never worked with NLP before, you’ll find it easy to load a model and start extracting entities in no time.

spaCy also supports multiple languages, and it provides pre-trained models that can be easily downloaded. For English, you might use en_core_web_sm, a lightweight model that balances speed with accuracy. But if you need more precision, you can opt for the larger en_core_web_lg model, which comes with better word vectors and a deeper understanding of language context.

Named Entity Recognition using spaCy

Now that you’ve got your environment ready, let’s jump into the fun part: actually performing Named Entity Recognition with spaCy.

Loading Pre-trained Models

spaCy makes it ridiculously easy to get started. All you need is a pre-trained model, and you’ll be extracting entities in just a few lines of code. Let me walk you through it:

import spacy

# Load the pre-trained model
nlp = spacy.load('en_core_web_sm')

# Apply the model to a sample text
doc = nlp("Apple is looking at buying U.K. startup for $1 billion")

# Loop through the identified entities
for ent in doc.ents:
    print(ent.text, ent.label_)

Let’s break this down. First, we load the English small model en_core_web_sm. Then, we apply it to a sentence that mentions Apple (the company), the U.K. (a location), and $1 billion (a monetary value). spaCy identifies these entities automatically.

You’ll notice that each entity comes with a label:

ORG (Organization): For companies like “Apple.”
GPE (Geopolitical Entity): For places like “U.K.”
MONEY: For monetary values like “$1 billion.”

These labels help you categorize the entities so you can decide how to use them. Pretty neat, right?

Custom NER using spaCy

But what if you want more control? What if you’re working in a specialized domain like healthcare or law, where the pre-trained models aren’t sufficient? This might surprise you, but spaCy allows you to train your own custom NER model with relative ease.

Let’s say you’re building a healthcare application and want to identify specific medical terms that aren’t in the default spaCy model. Here’s how you’d go about training a custom model:

Prepare Your Data: First, you’ll need annotated data—a set of sentences where you’ve labeled entities like “disease,” “drug,” and “symptom.” This is crucial for training.
Modify the Pre-trained Model: Instead of starting from scratch, you can fine-tune spaCy’s pre-trained model by adding new labels. This is both time-efficient and improves accuracy.
Train the Model: Once your data is ready, you can start the training process. Here’s a simplified version of how you’d fine-tune spaCy’s NER pipeline:

import spacy
from spacy.training import Example

# Load the pre-trained model
nlp = spacy.load("en_core_web_sm")

# Add a new entity label to the NER pipeline
ner = nlp.get_pipe("ner")
ner.add_label("DISEASE")

# Example training data
TRAIN_DATA = [
    ("Covid-19 has spread rapidly", {"entities": [(0, 7, "DISEASE")]}),
    ("The patient was diagnosed with pneumonia", {"entities": [(28, 36, "DISEASE")]}),
]

# Fine-tuning process
optimizer = nlp.resume_training()
for i in range(10):
    for text, annotations in TRAIN_DATA:
        doc = nlp.make_doc(text)
        example = Example.from_dict(doc, annotations)
        nlp.update([example], sgd=optimizer)

print("Model fine-tuned successfully!")

Named Entity Recognition using Transformers (Hugging Face)

Why Use Transformers for NER?

You might be wondering, “Why should I use transformer-based models when libraries like spaCy already do a great job with NER?” Well, here’s the deal: transformers have revolutionized NLP in recent years, and for a good reason. They provide an unprecedented level of contextual understanding.

Unlike traditional NER models that only look at the immediate neighborhood of words, transformer models like BERT (Bidirectional Encoder Representations from Transformers) understand the entire sentence at once—both the words before and after the target entity. This bidirectional nature allows transformers to truly capture the context in which a word is used, making them more accurate in cases where meaning is heavily dependent on sentence structure.

For instance, consider the sentence: “Jordan is a fantastic player.” Is Jordan a country or a person? Transformers use the broader context to understand that in this case, “Jordan” likely refers to a person, maybe even the basketball legend Michael Jordan. This ability to handle such nuances is what makes transformers so powerful for NER.

Another advantage of transformers is their multilingual capability. While older models might need separate training for each language, transformer models like mBERT (multilingual BERT) can perform NER across different languages with the same model. So whether you’re dealing with English, French, or Chinese, transformer-based NER models have got you covered.

Now, let’s take a look at some of the most popular transformer models for NER:

BERT: Known for its strong contextual understanding, BERT can handle everything from simple to highly nuanced text data.
RoBERTa: An improved version of BERT with better performance due to optimized training techniques.
DistilBERT: A lightweight version of BERT that maintains much of its accuracy while being faster and more efficient, making it a great choice when computational resources are limited.

NER with Hugging Face Transformers

So, now that you know why transformers are awesome, let’s get into the practical part. Thanks to Hugging Face’s transformers library, using these models for NER is easier than ever. You don’t need to build complex neural networks from scratch—the heavy lifting is done for you.

Here’s a quick example of how to use a pre-trained transformer model for NER:

from transformers import pipeline

# Create a pipeline for NER
ner_pipeline = pipeline("ner")

# Example text for NER
entities = ner_pipeline("Apple is looking at buying U.K. startup for $1 billion")

# Output the entities detected
for entity in entities:
    print(entity)

What’s happening here? First, we create an NER pipeline using Hugging Face’s pipeline function. This pipeline will automatically select a pre-trained model (usually BERT by default) and use it to identify entities in the text. In this case, it’ll detect “Apple” as an organization, “U.K.” as a location, and “$1 billion” as a monetary amount.

You’ll notice that you didn’t have to worry about loading models, tokenizing text, or applying any advanced techniques. Hugging Face has abstracted away all that complexity, allowing you to focus on the results.

Fine-tuning BERT for NER

Now, if you’re working on a specific domain—let’s say legal documents or medical records—out-of-the-box NER models might not give you the precision you need. That’s where fine-tuning comes in. When you fine-tune a transformer model like BERT, you essentially take a pre-trained model and specialize it for your domain using custom data.

Fine-tuning a model for NER involves the following steps:

Collect domain-specific labeled data: For example, you can use datasets like CoNLL-2003, which is commonly used for NER tasks, or create your own dataset with domain-specific entities.
Preprocess the data: You need to convert your data into a format that the model can understand (usually tokenized text with corresponding entity labels).
Train the model: Use the dataset to adjust the weights of the pre-trained model, allowing it to recognize new entities.

Here’s a simplified example of how to fine-tune BERT for a custom NER task:

from transformers import BertForTokenClassification, Trainer, TrainingArguments
from transformers import BertTokenizer
from datasets import load_dataset

# Load a pre-trained BERT model and tokenizer
model = BertForTokenClassification.from_pretrained("bert-base-cased", num_labels=9)
tokenizer = BertTokenizer.from_pretrained("bert-base-cased")

# Load the CoNLL-2003 dataset (or use your custom dataset)
dataset = load_dataset("conll2003")

# Define training arguments
training_args = TrainingArguments(
    output_dir='./results', 
    evaluation_strategy="epoch", 
    learning_rate=2e-5, 
    per_device_train_batch_size=16,
    num_train_epochs=3
)

# Initialize the Trainer
trainer = Trainer(
    model=model, 
    args=training_args, 
    train_dataset=dataset['train'],
    eval_dataset=dataset['validation']
)

# Fine-tune the model
trainer.train()

What’s going on here?

We’re using BertForTokenClassification, a version of BERT that is specifically designed for token-level tasks like NER.
We load the CoNLL-2003 dataset, which is a popular dataset for NER tasks. It includes text annotated with entities like PER (person), ORG (organization), and LOC (location).
We define the training arguments, such as the learning rate, batch size, and number of epochs.
Finally, we fine-tune the model by training it on the NER task.

Once trained, this model will perform far better on domain-specific data than a generic pre-trained model, allowing you to extract more accurate entities.

Evaluating NER Models

You’ve fine-tuned your model or used a pre-trained one—now what? It’s time to evaluate how well the model performs.

Evaluation Metrics

When evaluating NER models, we typically focus on three key metrics: precision, recall, and F1-score.

Precision tells you the proportion of identified entities that are correct. If your model says “Apple” is a company, was it actually right?
Recall measures how many of the actual entities your model identified. If there were 10 organizations in a text but your model only caught 8, your recall is 80%.
F1-score combines precision and recall into a single number, giving you a balanced view of your model’s performance.

Here’s how you can compute these metrics using the seqeval library, which is specially designed for sequence labeling tasks like NER:

from seqeval.metrics import classification_report

# Sample true and predicted labels
true_labels = [['O', 'B-ORG', 'O', 'B-GPE', 'O', 'B-MONEY']]
predicted_labels = [['O', 'B-ORG', 'O', 'B-GPE', 'O', 'B-MONEY']]

# Generate evaluation report
print(classification_report(true_labels, predicted_labels))

This simple report will give you the precision, recall, and F1-score for each entity type (e.g., ORG, GPE, MONEY).

Error Analysis

Even the best NER models aren’t perfect. To truly understand where your model is going wrong, you need to perform error analysis. This involves looking at specific examples where your model failed—whether it missed an entity, mislabeled one, or struggled with boundaries.

For instance, your model might label “New York City” as two separate entities: “New York” (GPE) and “City” (O). This is an example of incorrect boundary detection. Understanding these errors allows you to either retrain the model or improve your data quality.

Here’s an example of how you might start your error analysis:

# Example function to analyze misclassified entities
def analyze_errors(true_labels, predicted_labels, tokens):
    for true, pred, token in zip(true_labels, predicted_labels, tokens):
        if true != pred:
            print(f"Token: {token}, True label: {true}, Predicted label: {pred}")

# Sample tokens
tokens = ["Apple", "is", "based", "in", "New", "York", "City"]

# Analyze misclassified tokens
analyze_errors(true_labels[0], predicted_labels[0], tokens)

By carefully analyzing these misclassifications, you’ll have a better understanding of how to improve your NER system, whether through additional training, better data, or adjusting your model’s architecture.

Advanced Techniques

Handling Ambiguity in Entities

Let’s face it, language is messy. Words can have different meanings depending on the context, and this creates a challenge when performing NER. You might be wondering, “How do we handle situations where the same word refers to multiple entities?” For example, “Apple” could mean a fruit or the tech giant, and “Jordan” could refer to a country or a person. This is where entity disambiguation comes into play.

One of the best ways to resolve ambiguity is through entity linking. This technique goes beyond just identifying an entity—it links the identified entity to a unique identifier in a knowledge base (e.g., linking “Apple” to its Wikipedia page). You can think of it as a process that says, “I know what this word means in this context.”

SpaCy offers an entity linking pipeline that connects entities to their respective knowledge bases. It’s a powerful way to resolve ambiguity, especially when working with large datasets filled with entities that have multiple meanings.

Another powerful approach is to use contextual embeddings like BERT or GPT. These models analyze the surrounding words to infer the correct meaning of an entity. For instance, in the sentence “Jordan is a basketball legend,” BERT understands that “Jordan” refers to a person based on the context provided by the words “basketball” and “legend.”

Here’s how you can use OpenAI’s GPT for disambiguation. GPT excels at understanding context:

from transformers import GPT2Tokenizer, GPT2LMHeadModel

# Load GPT-2 model and tokenizer
tokenizer = GPT2Tokenizer.from_pretrained("gpt2")
model = GPT2LMHeadModel.from_pretrained("gpt2")

# Example sentence with ambiguous entity
text = "Jordan is a basketball legend."

# Tokenize and encode the input
inputs = tokenizer.encode(text, return_tensors="pt")

# Generate continuation of the sentence for context understanding
outputs = model.generate(inputs, max_length=50, do_sample=True)

print(tokenizer.decode(outputs[0]))

This isn’t a direct NER implementation but shows how GPT models can help resolve ambiguities based on broader context, making entity linking and disambiguation more accurate.

Multi-Language NER

We live in a multilingual world, and if you’re working on global projects, you’ll want your NER model to handle more than just English. Here’s the deal: Transformer models like mBERT (Multilingual BERT) and XLM-RoBERTa are designed for this very purpose. These models are pre-trained on multiple languages and can recognize entities across various languages with minimal fine-tuning.

Here’s how you can adapt an NER model to handle multiple languages using mBERT:

from transformers import pipeline

# Create an NER pipeline using a multilingual BERT model
multilang_ner_pipeline = pipeline("ner", model="bert-base-multilingual-cased")

# Example non-English text
entities = multilang_ner_pipeline("Apple está considerando comprar una startup en el Reino Unido por 1 mil millones de dólares.")

# Output the entities detected in Spanish
for entity in entities:
    print(entity)

In this example, the model understands that “Apple” refers to an organization and “Reino Unido” refers to a geopolitical entity, all while processing the text in Spanish. This makes mBERT a go-to model when you’re dealing with multilingual datasets.

NER in Low-Resource Languages

Here’s something that might surprise you: Not all languages have the same amount of training data available. While English, Spanish, or Chinese are well-represented in NLP, many low-resource languages don’t have as much data for training models. So how do you perform NER when you have little or no labeled data for a language?

One effective technique is transfer learning. This involves taking a model trained on a high-resource language and fine-tuning it on a smaller dataset in your target low-resource language. Another approach is zero-shot learning, where a model trained on one language is directly applied to another language without any retraining. This is possible with models like mBERT, which understand multiple languages natively.

For low-resource NER, transfer learning often looks like this:

Train your NER model on a language with plenty of data (e.g., English).
Fine-tune that model with whatever data you have in your low-resource language, or even a similar language.

This way, you can still extract valuable entities without having a massive dataset at your disposal.

Custom Dataset for NER

Building and Annotating Your Dataset

Now, let’s switch gears. Imagine you’re working on a niche industry project, say healthcare or law, and the pre-trained NER models just don’t capture the entities you need. What’s your next move? Well, custom NER models require custom datasets, and that’s where data annotation comes in.

You need to manually annotate your data to teach your model which entities to recognize. This can sound daunting, but there are tools like Prodigy and Label Studio that make this process much easier. These tools let you highlight entities in your dataset and assign labels, making it faster to build a large, labeled dataset.

Here’s how to use Prodigy for NER annotation:

Load Your Dataset: Start with a raw text file.
Set Up Labels: Define the custom entities you’re interested in, like “DISEASE” or “DRUG.”
Annotate: Use the Prodigy interface to highlight the entities in your text.

prodigy ner.manual your_dataset en_core_web_sm ./your_data.jsonl --label PERSON,ORG,DISEASE,DRUG

This simple command sets you up to annotate your text with the specified labels. It’s important to balance your dataset by including examples of all your target entities and ensuring that you annotate edge cases like nested entities or overlapping entities. Doing this right will help your model generalize better during training.

Best Practices for Dataset Creation

Balance your dataset: Ensure that your dataset includes a variety of entities and isn’t skewed toward one type.
Handle edge cases: Pay attention to special cases like multi-word entities (e.g., “New York City”) or nested entities (e.g., “The CEO of Apple, Tim Cook”).
Consistent labeling: Maintain consistency in labeling—use a style guide if needed, so everyone annotating the data follows the same rules.

Training a Custom NER Model

Once you’ve annotated your dataset, it’s time to train your custom NER model. Whether you’re using spaCy or Hugging Face transformers, the process is similar to fine-tuning a pre-trained model, except now you’ll be training on your custom entities.

Here’s a simplified walkthrough for training a custom NER model using spaCy:

import spacy
from spacy.training import Example

# Load a pre-trained model
nlp = spacy.blank("en")  # Starting with a blank English model

# Add the NER component
ner = nlp.add_pipe("ner")

# Define your custom entities
ner.add_label("DISEASE")
ner.add_label("DRUG")

# Annotated data
TRAIN_DATA = [
    ("Covid-19 is spreading fast", {"entities": [(0, 8, "DISEASE")]}),
    ("The doctor prescribed aspirin", {"entities": [(24, 31, "DRUG")]}),
]

# Training the model
optimizer = nlp.begin_training()

# Fine-tuning loop
for i in range(10):
    for text, annotations in TRAIN_DATA:
        doc = nlp.make_doc(text)
        example = Example.from_dict(doc, annotations)
        nlp.update([example], sgd=optimizer)

print("Custom NER model trained successfully!")

Hyperparameter Tuning and Model Validation

You might be thinking, “How do I make sure my model performs well?” Well, this is where hyperparameter tuning comes in. You’ll want to experiment with parameters like learning rate, batch size, and number of epochs to strike the right balance between speed and accuracy.

For model validation, always set aside a portion of your dataset for testing. This will give you an unbiased evaluation of your model’s performance and help you spot any overfitting or underfitting.

Finally, when you’re happy with the model’s performance, you can deploy it in your applications—whether it’s a chatbot, search engine, or any other text-based tool.

Conclusion

Congratulations! You’ve made it to the finish line. By now, you should have a solid grasp of how Named Entity Recognition (NER) works, the various tools and techniques available to you, and how to implement NER using Python. Whether you’re using the simplicity of spaCy or leveraging the power of transformer-based models from Hugging Face, you’re well-equipped to tackle entity extraction in any domain or language.

Let’s quickly recap some of the key takeaways:

NER is a critical component of Natural Language Processing, helping machines identify and categorize important entities like people, organizations, and dates from unstructured text.
Tools like spaCy offer speed and ease of use, while transformer-based models like BERT provide superior contextual understanding and multilingual capabilities.
Custom NER models can be built and fine-tuned to suit domain-specific needs, and with the right data and techniques, you can achieve high accuracy in even the most complex tasks.
Handling challenges like entity disambiguation, working in low-resource languages, and creating custom datasets is all part of taking your NER system from good to great.

As you go forward, I encourage you to experiment with the techniques we’ve covered here. Whether you’re working on extracting information from legal contracts, performing sentiment analysis, or building advanced chatbots, NER will be an indispensable tool in your NLP arsenal.

The beauty of NER lies not just in extracting entities but in understanding the context in which those entities exist. And as NLP evolves, so too will the possibilities for more intelligent, context-aware systems.

Remember, data science is all about continuous learning and iteration. Your first model might not be perfect, but with every experiment and tweak, you’ll get closer to a system that can unlock valuable insights from text in ways you never thought possible.

Good luck with your projects, and keep pushing the boundaries of what’s possible with NER!