Skip to main content
Filter by
Sorted by
Tagged with
0 votes
1 answer
242 views

I'm using Python 3.12, and I'm trying to install the latest version (1.15.1) of TensorFlow Data Validation. However, while installing, I encountered the following error: ERROR: Could not find a ...
Ranjith kumar K's user avatar
0 votes
1 answer
99 views

By the following vba excel code which I found on the internet, I can select more than one option within a cell from the dv dropdown list at the same time. What I am trying to do is to add a checkmark ...
PYC's user avatar
  • 23
0 votes
1 answer
249 views

I want to create a new TensorFlow Data Validation schema from scratch with fixed features name, type and presence. import tensorflow_data_validation as tfdv from tensorflow_metadata.proto.v0 import ...
Maxime Oriol's user avatar
-1 votes
1 answer
425 views

When I import tensorflow data validation using import tensorflow_data_validation as tfdv I get ModuleNotFoundError: No module named 'tensorflow_data_validation' When I want to install it using !pip ...
Waqar Ahmed's user avatar
0 votes
1 answer
400 views

I'm trying to install tensorflow_data_validation but it shows the below error. it says that i need Python version 2.7 or 3.4+ but I'm using the version 3.10.9. Also I have the 2.11.0 version of ...
Luis López's user avatar
3 votes
4 answers
4k views

["referring to the data shown in the image while using the filter function and getting the return values, How can I return blank cells as it is instead of printing as Zeros while using filter ...
user21810668's user avatar
0 votes
1 answer
101 views

a superimposed display for train/val splits using StatisticsGen Hi, I'm currently using tfx pipeline inside kubeflow. I struggle to have StatisticsGen showing a single graph with train and validation ...
Youcef Kacer's user avatar
-1 votes
1 answer
107 views

I tried to import and use the tensorflow_data_validation library, but I faced this error: ContextualVersionConflict: (google-api-core 1.25.1 (c:\users\amoosi\anaconda3\lib\site-packages), Requirement....
Ehsan MrH's user avatar
0 votes
2 answers
125 views

I am stucked in getting this block of statement.In my serving sets there are some anomalies. I am not getting what this below code is doing in removing the anomalies. payer_code = tfdv.get_feature(...
Taniya Sisodia's user avatar
0 votes
1 answer
257 views

"TypeError: statistics is of type StatsOptions, should be a DatasetFeatureStatisticsList proto." error shows when I am generating schema using tfdv.infer_schema() option but I am not able to ...
Mansi Mehta's user avatar
1 vote
1 answer
808 views

I am new to Joi. I try to put some rules to signup users. But every new user could signup and "if" command is always executed. const Joi = require ('joi') const authSchema = Joi.object()....
pusat_almdr's user avatar
0 votes
1 answer
181 views

I have problem with installing TensorFlow Data Validation (TFDV). I have already TensorFlow installed (version 2.5.0) and I don't want to upgrade that version. I used this command: pip install ...
Szymon Balawajder's user avatar
1 vote
1 answer
600 views

I have built a pipeline that read some data, does some manipulations and create some apache beam Row objects (Steps 1 and 2 in the code below). I then would like to generate statistic and write them ...
DarioB's user avatar
  • 1,649
0 votes
1 answer
384 views

I am following this "get started" tensorflow tutorial on how to run tfdv on apache beam on google cloud dataflow. My code is very similar to the one in the tutorial: import ...
DarioB's user avatar
  • 1,649
4 votes
1 answer
609 views

I want to infer a schema with tensorflow data validation (tfdv) based on a pandas dataframe of the training data. The dataframe contains a column with a multivalent feature, where multiple values (or ...
ppmt's user avatar
  • 177
0 votes
1 answer
543 views

Tensorflow Data Validation provides a way to find anomalies in your data. However, I am able to find only a way to provide a summarized version of the anomalies (by using tfdv.validate_statistics and ...
datapug's user avatar
  • 2,441
0 votes
0 answers
46 views

Has anyone been able to install TFDV on a Jetson AGX or similar device with AArch64 architecture? I have tried installing with PyPI, building with Docker, and building from source with no luck. I am ...
Gavin Hanley's user avatar
2 votes
4 answers
3k views

Since I'm moving away from pandas DataFrames to TensorFlow datasets, I'd like to use tensorflow-data-validation instead of the more traditional pandas-profiling when it comes to data exploration and ...
Tfovid's user avatar
  • 863
0 votes
1 answer
513 views

I follow the instruction of Fine-tuning BERT to build a model with my own dataset(It is kind of large, and greater than 20G), then take steps to re-cdoe my data and load them from tf_record files. ...
bruceyuan's user avatar
  • 561
1 vote
0 answers
103 views

i have a problem when generating stats over a small dataset (~10MB). it takes to much time and consume to much memory (it reaches 25 GB of my memory which makes no sense ). and at the end it stops ...
Amine_h's user avatar
  • 129
5 votes
1 answer
787 views

Question: Is it possible to run tensorflow-data-validation on MacOS with M1 chip? Steps taken: I have created a conda environment (tfdv38) in which I have installed the Mac-optimized TensorFlow. I ...
marie's user avatar
  • 51
1 vote
1 answer
452 views

I'm using TFX (more precisely TensorFlow Data Validation) with the infer_schema method documented there https://www.tensorflow.org/tfx/data_validation/api_docs/python/tfdv/infer_schema. It generates a ...
Pixou's user avatar
  • 1,789
1 vote
1 answer
417 views

I am presently trying to feed a dataset with a few multivalent feature columns through a TensorFlow Extended (TFX) pipeline. Here is a row from my sample data: user_id 29601 ...
Michael's user avatar
  • 353
0 votes
1 answer
82 views

i have a pandas dataframe of grouped values every hour 2020.05.05 17:00:00 cpu_usage_specint 4.354603 phys_iops 3075.779680 total_memory ...
higgytech's user avatar
2 votes
1 answer
359 views

Regarding TFX' tensorflow-data-validation, I'm trying to understand when I should use *Gen components vs. using TFDV provided methods. Specifically, what's confusing me is that I have this as my ...
Lorin S.'s user avatar
  • 760
0 votes
0 answers
276 views

I’m running into a problem of tensorflow-data-validation with direct runner to generate statistics from some large datasets over 400GB. It seems that all workers stopped working after an error message ...
junjian.xu's user avatar
2 votes
1 answer
876 views

Need help some in understanding the difference between the underlying implementation of skew and drift comparator in TFDV.
arnab_0017's user avatar
0 votes
0 answers
107 views

Running the model.fit(...) method on my dataset. I keep getting errors like this: ValueError: in user code: ... ValueError: Could not find matching function to call loaded from the SavedModel. Got: ...
ANTi7kZ's user avatar
  • 11
0 votes
0 answers
523 views

I have been trying to run TensorFlow Data Validation following Google Documents Followed same steps as https://www.tensorflow.org/tfx/data_validation/install: >pip install tensorflow-data-...
SNEH PANDYA's user avatar
1 vote
3 answers
1k views

I've been testing Tensorflow Data Validation (version 0.22.0) to use in my current ML pipelines and I noticed it does not get any anomaly in numerical features. For instance, > import pandas as ...
Anilton Cardoso Junior's user avatar
0 votes
4 answers
874 views

It is puzzling to me that there is a tfdv.load_statistics() function, but no corresponding tfdv.write_statistics() function. How do I go about saving the statistics, and then loading them again? e.g. ...
Stefan Krawczyk's user avatar
0 votes
1 answer
166 views

I'm using Tensorflow data validation and want to make sure a column has no duplicate values. But it seems Tensorflow's data validation doesn't have a feature like Deequ's isUnique function to check ...
Frank's user avatar
  • 1,285
1 vote
2 answers
2k views

I was trying to implement TensorFlow Data Validation to check drift/skew in a dataset. They are using the L-infinity norm as a metric. I didn't understand the concept. Can anyone explain how it is ...
Sairam Reddy's user avatar
1 vote
0 answers
181 views

I use anaconda and I have python 3.7, I tried to install tensorflow_data_validation from anaconda's prompt but it gives me the following error: Could not find a version that satisfies the requirement ...
Anonima19's user avatar
  • 121
1 vote
1 answer
328 views

I am using tensorflow data validation and I am trying to build schemas around my datasets. I've built the initial schemas and I can see/edit them in notepad, but I'm having a hard time actually ...
ecda_123's user avatar
1 vote
1 answer
3k views

I get this error with Python 3.7 and Windows 10 64-bit (which is supported). It seems that there are only wheels for Windows for Python 3.5 and 3.6... Thanks in advance. pip install tensorflow-data-...
Antonio's user avatar
  • 21
2 votes
2 answers
483 views

I want to enable numerical feature drift without setting a domain. Ex: I am scoring customers based on age and in my training set I had a uniform distribution of the variable. Now, in my serving data, ...
DataRevolution's user avatar
4 votes
1 answer
1k views

I would like to use TFMA with keras model. The keras model was created with TF 2.0 alpha. The model is a pretrained model with a classification layer: ...
eilalan's user avatar
  • 689
1 vote
2 answers
431 views

Is there any version of Tensorflow data validation (tfdv) that can work with tensorflow 2.0 alpha ?
eilalan's user avatar
  • 689
0 votes
1 answer
486 views

I looked at the TFX guide (https://www.tensorflow.org/tfx/guide), and I was able to run the pipeline. The StatisticsGen component wrote Dataset statistics to a file on disk. How do I visualize the ...
user8797249's user avatar
0 votes
1 answer
200 views

Following basic tutorial, and trying to generate stats for a TFRecord, I get a overflow error: import tensorflow_data_validation as tfdv stats = tfdv.generate_statistics_from_tfrecord(data_location='/...
user2934244's user avatar
0 votes
1 answer
358 views

I'm using Tensorflow Data Validation to generate stats from the data and infer an schema to input in TFX. I didn't find any option to specify the NaN values, e. g., in pandas there is a field "...
Fernando Silva's user avatar
1 vote
1 answer
711 views

I'm trying to get TFDV working with RGB images as feature inputs, reading from a TFRecords file. I can read/write the image data to TFRecord files fine. Here's the relevant code snippets for writing, ...
PaulH's user avatar
  • 31
1 vote
1 answer
110 views

I am trying to run tensor flow data validation feature on data sets (CSV) > 2 Gig. It is crashing after some time. It runs very well if data set is around 1 Gig. How to handle large data sets without ...
test 9539ba's user avatar
5 votes
3 answers
2k views

TFDV generates schema as a Schema protocol buffer. However it seems that there is no helper function to write/read schema to/from a file. schema = tfdv.infer_schema(stats) How can I save it/load it ?
Vincent Teyssier's user avatar
1 vote
1 answer
196 views

I am not able to install tensorflow-data-validation using pip. when I use: pip install tensorflow-data-validation I get this error: Could not find a version that satisfies the requirement ...
Deepak B's user avatar