Skip to main content
Filter by
Sorted by
Tagged with
Advice
0 votes
1 replies
38 views

We’re working on a problem where our master dataset contains names of organizations and individuals, but some entries are untagged. We only have the names (no additional details such as email or ...
MJ17's user avatar
  • 109
1 vote
1 answer
65 views

I'm using SpaCy to process a series of sentences and return the five most common words in each sentence. My goal is to store the output of that frequency analysis (using Counter) in a column beside ...
cmr's user avatar
  • 23
0 votes
0 answers
51 views

I'm taking a stab at building my own claim extraction pipeline (first time spaCy user). Upstream in my pipeline, I feed n amount of docs to NER in the en_core_web_sm pretrained model in order to ...
muelikath_8's user avatar
0 votes
0 answers
55 views

I am having the hardest of times getting my training session to use my gpu 0 which by every measure is present and correctly setup with cuda 12.2. When I try to do python -m spacy train base_config....
demongolem's user avatar
  • 9,754
1 vote
1 answer
139 views

I’m working on anonymizing PII in banking text using Microsoft Presidio . The built-in PERSON recognizer (which uses spaCy under the hood) works for some Western names and when the sentence is clear ...
karthik's user avatar
  • 11
2 votes
1 answer
77 views

I'm working on a project where I'm attempting to extract medical symptoms from a running transcription. I'm using SocketIO to get mic audio and then using Whisper to transcribe the audio into text ...
saksham shankar's user avatar
1 vote
2 answers
90 views

I am having 200 documents and I want to do NER and pos_tagging. However I find spacy to be too slow(I am running this code in google colab): for doc in nlp.pipe(dataset["text"], batch_size=...
Kishan Kumar's user avatar
0 votes
0 answers
62 views

I am traing to train a spaCy v 3.8.7 spancat model on ~100 sustainability reports (annotated with Prodigy) to extract KPIs and targets. An NER pipeline trained on the same data reaches F≈0.40, but ...
Maryna Gulenko's user avatar
0 votes
1 answer
175 views

Having created a new venv I am attempting to install spacy strictly in accordance with the documentation Specifically: pip install -U pip setuptools wheel pip install -U 'spacy[apple]' This fails (...
jackal's user avatar
  • 29.1k
-2 votes
2 answers
512 views

I'm new to Python and I was given this code by my professor which includes "import spacy" and when I run the code I get the line: ModuleNotFoundError: No module named 'spacy' That's where I ...
NextGen AI's user avatar
0 votes
0 answers
23 views

How can I extract a single noun that is the head of multiple children? I'm facing an issue in dependency matching in spaCy. I want to extract the nouns describing the name entities (identified by ...
Emma's user avatar
  • 1
4 votes
0 answers
62 views

The latest spaCy versions have better performance and compatibility for GPU acceleration on Apple devices, but I have an existing project that depends on spaCy 3.1.4 and some of the specific behavior ...
synchronizer's user avatar
  • 2,115
0 votes
0 answers
32 views

While using spacy transformer pipeline en_core_web_trf. How to retrieve the transformer tokenization (often roberta-base), it can be the tokenizer ids, tokenizer strings, or both (preferably). Actual ...
Lin's user avatar
  • 1,211
0 votes
1 answer
313 views

Currently I'm using spacy-layout as part of a pipeline to OCR documents and analyse documents. However, I also need to access other features of Docling such as counting the number of images in each ...
Virtual Architectures's user avatar
0 votes
1 answer
163 views

Hi Im trying to install Spacy on my win 11 laptop. I have python (3+) and pip (latest) already installed. However when I run the install command as indicated on the website - pip install -U spacy the ...
Sachin Shanbhag's user avatar
1 vote
1 answer
207 views

I get the following error while installing the spacy library in Python 3.13.0. The pip version is 25.0.1. Can someone help? Thank you. (I made sure to install numpy, scipy, preshed,Pyrebase4 based on ...
user17144's user avatar
  • 458
0 votes
1 answer
476 views

I'm using spaCy with the pl_core_news_lg model to extract named entities from Polish text. It correctly detects both organizations (ORG) and people's names (PER): import spacy nlp = spacy.load("...
Maltion's user avatar
  • 89
0 votes
1 answer
70 views

Dataset structureCan somebody help me with the NER model in converting the data into spacy format. The dataset format is shown in the screenshot here (https://www.kaggle.com/datasets/naseralqaydeh/...
Rohit Gupta's user avatar
2 votes
0 answers
76 views

I'm working on normalizing ingredient names from a recipe dataset using Python and spaCy. My goal is to extract only the relevant ingredients and ignore measurement units, fractions, and other ...
Островська Катя's user avatar
0 votes
2 answers
282 views

I wanted to use spacy to work on a project, but it cannot be installed using pip and is showing the following error message in command prompt pip install spacy Collecting spacy Using cached spacy-3....
Ronal George's user avatar
0 votes
1 answer
69 views

I'm trying to write a spacy parser to extract the names and terms of a contract. To do that, I've written a rule to extract the sellers and buyers, except it's extracting multiple times over a simple ...
kernel density's user avatar
4 votes
2 answers
377 views

I am using presidio/langchain_experimental to anonymize text in Polish, but it does not detect names (e.g., "Jan Kowalski"). Here is my code: from presidio_anonymizer import ...
Maltion's user avatar
  • 89
0 votes
1 answer
115 views

Sometimes, a Python library depends on additional data, such as ML models. This could be a model from transformers, spacy, nltkand so on. Typically there is a command to download such a model: python -...
jdm's user avatar
  • 10.3k
0 votes
0 answers
50 views

I'm working on a relation extraction model task using a transformer-based model. the `pipeline is expected to extract entity pairs along with their labelled relation labels. When I run the evaluation ...
user29424767's user avatar
0 votes
1 answer
42 views

I would like to know which version of Rasa is compatible with which version of spaCy. I tried to create a bot with Rasa==3.5.10, Spacy==3.2.4 but couldn't. I tried to use another version of spacy and ...
kakiutidev's user avatar
2 votes
1 answer
54 views

I'm using spacy for some downstream tasks, mainly noun phrase extraction. My texts contain a lot of parentheses, and while applying the lemma, I noticed all the punctuation that doesn't end sentences ...
MERose's user avatar
  • 4,481
3 votes
1 answer
187 views

I want to run an entity linking job using a custom Knowledgebase alone, and not use the second step ML re-ranker that requires a training dataset / Spacy corpus. I want the NEL pipeline to only assign ...
lrthistlethwaite's user avatar
1 vote
1 answer
76 views

I have a series of product reviews from multiple websites and am trying to identify reviews that are potentially duplicates (ie very similar in the words used). I know there's a lot of room for ...
timpone's user avatar
  • 20.1k
2 votes
1 answer
74 views

I'm using spaCy to extract and identify entity types (like ORG, GPE, DATE, etc.) from a text description. However, I am noticing some incorrect results, and I'm unsure how to fix this. Here is the ...
PrakashT's user avatar
  • 931
2 votes
4 answers
2k views

I am using trying to install spacy library using 'pip install -U spacy' in the command prompt (run as admin) in Windows-11 O.S., but it shows some error I don't understand. I am using Python 3.13.0, ...
RohanVarma Ragula's user avatar
0 votes
0 answers
255 views

I tried installing spacy but got the following error message: blis 1.0.1 has requirement numpy<3.0.0,>=2.0.0, but you have numpy 1.23.5. thinc 8.3.2 has requirement numpy<2.1.0,>=2.0.0; ...
MUHAMMAD KASHIF's user avatar
1 vote
0 answers
86 views

I am getting into troubles as I try to install spaCy in a Fedora 41, AMD Ryzen machine. I got this: Preparing metadata (pyproject.toml): finished with status 'error' error: subprocess-exited-with-...
hjmnzs's user avatar
  • 141
1 vote
1 answer
203 views

Given a generic text sentence (in a specific context) how can I extract word/entities of interest belonging to a specific "category" using python and any NLP library? For example given a ...
Riccardo Raffini's user avatar
1 vote
1 answer
82 views

Hi I am trying to create a chatbot using chatterbot, any ideas on what I should do regarding the error? my code: ///this is to just train the bot/// from chatterbot import ChatBot from chatterbot....
majid's user avatar
  • 11
2 votes
1 answer
62 views

I am working on linking short texts to entities in a biomedical knowledge graph (UMLS CUIs) using SciSpacy for a research project. The goal is to analyze the relationship between the linked entity and ...
GrimSqueaker's user avatar
3 votes
2 answers
141 views

I'm working on a multiprocessing python application where multiple processes need access to a large, pre-loaded spaCy NLP model (e.g., en_core_web_lg). Since the model is memory-intensive, I want to ...
Héctor Balsells Roure's user avatar
4 votes
1 answer
259 views

I want to use Spacy to generate embeddings of text stored in a polars DataFrame and store the results in the same DataFrame. Next, I want to save this DataFrame to the disk and be able to load again ...
HappyHippo's user avatar
0 votes
1 answer
649 views

I'm trying to follow a simple example from spacy universe layers page, but this is failing for me: Code Implementation: # template.yaml file AWSTemplateFormatVersion: "2010-09-09" Transform:...
Willian's user avatar
  • 3,475
-1 votes
2 answers
143 views

I have a pandas data frame with a column of text values (documents). I want to apply lemmatization on these values with the spaCy library using the pandas apply function. I've defined my to_lemma ...
Patrick's user avatar
  • 2,346
5 votes
4 answers
8k views

trying to install Spacy using pip install -U spacy, but getting the following error message: C:\Windows\System32>pip install spacy Collecting spacy Using cached spacy-3.8.2.tar.gz (1.3 MB) ...
Mohammed Rasheed's user avatar
1 vote
0 answers
228 views

I am trying to run spacy-llm with my organization's azure openai api that they have provided me. But since I am pretty beginner in Python, I guess I am not writing the config file correctly, and hence ...
Virad Gupta's user avatar
0 votes
1 answer
215 views

I am working in a text that contains biomedical entities. However medspacy package failed to detect those: import medspacy nlp = medspacy.load() text = "The patient was treated with warfarin ...
user3104352's user avatar
  • 1,130
1 vote
1 answer
133 views

I'm working with a Polars DataFrame and I want to remove stopwords from a specific column using SpaCy with GPU support. I have the following setup: import polars as pl import spacy # Load SpaCy with ...
Naman Kumar Muktha's user avatar
2 votes
1 answer
392 views

I 'am trying to find similar vector with spacy and numpy. I found the code following url : Mapping word vector to the most similar/closest word using spaCy But I'm getting type error import numpy as ...
onder's user avatar
  • 857
1 vote
1 answer
47 views

I've downloaded the UD Treebank dataset, set up a shell script to discover all folders for a given language and converted the .conllu files to .spacy. Now I have a collection of files like this: ...
lhk's user avatar
  • 30.7k
1 vote
0 answers
35 views

nlp= spacy.load(en_core_web_lg-3.7.1) name = 'Las Palmas Mexican Restaurant & Bar' doc = nlp(name) for token in doc: print(f"{token.text} \t{token.ent_type_} \t{token.ent_iob_}&...
Saeed's user avatar
  • 2,151
0 votes
0 answers
95 views

I am trying to install a spaCy model: pip install https://github.com/explosion/spacy-models/releases/download/en_core_web_sm-3.0.0/en_core_web_sm-3.0.0.tar.gz and getting the following error: error: ...
cdd's user avatar
  • 515
2 votes
0 answers
139 views

I'm trying to process a CV-like text, more exactly to split it into parts by their meaning (Description, Contacts, Experience, Education, Certifications etc). Would NER be suitable for this purpose (...
Andrei Yusupau's user avatar
0 votes
1 answer
52 views

I am trying to extract only the first speaker's name from a list of texts using spaCy. Currently, my function returns all "PER" tags, but I want to reduce the overhead and get only the first ...
fiskdill's user avatar
-1 votes
1 answer
51 views

OSError: [E050] Can't find model 'en_core_web_sm'. It doesn't seem to be a Python package or a valid path to data directory. this is the error i am getting MY code: from chatterbot import ChatBot from ...
Kritik Munot's user avatar

1
2 3 4 5
76