46 questions
0
votes
1
answer
242
views
Tensorflow-data-validation installing issue
I'm using Python 3.12, and I'm trying to install the latest version (1.15.1) of TensorFlow Data Validation. However, while installing, I encountered the following error:
ERROR: Could not find a ...
0
votes
1
answer
99
views
By excel vba adding checkmarks to each options selected in a multi-select datavalidation dropdown list
By the following vba excel code which I found on the internet, I can select more than one option within a cell from the dv dropdown list at the same time. What I am trying to do is to add a checkmark ...
0
votes
1
answer
249
views
How to define features presence in a TensorFlow Data Validation schema?
I want to create a new TensorFlow Data Validation schema from scratch with fixed features name, type and presence.
import tensorflow_data_validation as tfdv
from tensorflow_metadata.proto.v0 import ...
-1
votes
1
answer
425
views
Problem of tensorflow data validation on google colab
When I import tensorflow data validation using
import tensorflow_data_validation as tfdv
I get
ModuleNotFoundError: No module named 'tensorflow_data_validation'
When I want to install it using
!pip ...
0
votes
1
answer
400
views
Installing tensorflow_data_validation is causing error #tensorflow_data_validation
I'm trying to install tensorflow_data_validation but it shows the below error.
it says that i need Python version 2.7 or 3.4+ but I'm using the version 3.10.9.
Also I have the 2.11.0 version of ...
3
votes
4
answers
4k
views
How can I return blank cells as it is instead of printing as Zeros while using filter function
["referring to the data shown in the image while using the filter function and getting the return values, How can I return blank cells as it is instead of printing as Zeros while using filter ...
0
votes
1
answer
101
views
tfx.components.StatisticsGen display train and eval in two different figures, is it possible to have them in a single figure as tfdv does?
a superimposed display for train/val splits using StatisticsGen
Hi,
I'm currently using tfx pipeline inside kubeflow. I struggle to have StatisticsGen showing a single graph with train and validation ...
-1
votes
1
answer
107
views
Problem in importing tensorflow_data_validation
I tried to import and use the tensorflow_data_validation library, but I faced this error:
ContextualVersionConflict: (google-api-core 1.25.1 (c:\users\amoosi\anaconda3\lib\site-packages), Requirement....
0
votes
2
answers
125
views
relaxing the anomaly detection constraints in MLOps
I am stucked in getting this block of statement.In my serving sets there are some anomalies.
I am not getting what this below code is doing in removing the anomalies.
payer_code = tfdv.get_feature(...
0
votes
1
answer
257
views
Why I am not able to generate schema using tfdv.infer_schema()?
"TypeError: statistics is of type StatsOptions, should be a DatasetFeatureStatisticsList proto." error shows when I am generating schema using tfdv.infer_schema() option but I am not able to ...
1
vote
1
answer
808
views
create a data validation using JOI module
I am new to Joi. I try to put some rules to signup users. But every new user could signup and "if" command is always executed.
const Joi = require ('joi')
const authSchema = Joi.object()....
0
votes
1
answer
181
views
Is it possible to install specific version of TFDV without upgrading TF?
I have problem with installing TensorFlow Data Validation (TFDV).
I have already TensorFlow installed (version 2.5.0) and I don't want to upgrade that version.
I used this command:
pip install ...
1
vote
1
answer
600
views
apache beam rows to tfrecord in order to GenerateStatistics
I have built a pipeline that read some data, does some manipulations and create some apache beam Row objects (Steps 1 and 2 in the code below). I then would like to generate statistic and write them ...
0
votes
1
answer
384
views
tensorflow data validation tfdv fails on google cloud dataflow with "Can't get attribute 'NumExamplesStatsGenerator' "
I am following this "get started" tensorflow tutorial on how to run tfdv on apache beam on google cloud dataflow. My code is very similar to the one in the tutorial:
import ...
4
votes
1
answer
609
views
Infer multivalent features with tfdv from pandas dataframe
I want to infer a schema with tensorflow data validation (tfdv) based on a pandas dataframe of the training data. The dataframe contains a column with a multivalent feature, where multiple values (or ...
0
votes
1
answer
543
views
TensorFlow Data Validation - How to return the rows with anomalies
Tensorflow Data Validation provides a way to find anomalies in your data.
However, I am able to find only a way to provide a summarized version of the anomalies (by using tfdv.validate_statistics and ...
0
votes
0
answers
46
views
Installing TensorFow Data Validation on Jetson AGX (AArch64)
Has anyone been able to install TFDV on a Jetson AGX or similar device with AArch64 architecture? I have tried installing with PyPI, building with Docker, and building from source with no luck. I am ...
2
votes
4
answers
3k
views
tensorflow-data-validation cannot be pip installed
Since I'm moving away from pandas DataFrames to TensorFlow datasets, I'd like to use tensorflow-data-validation instead of the more traditional pandas-profiling when it comes to data exploration and ...
0
votes
1
answer
513
views
Dimension does not match when using `keras.Model.fit` in `BERT` of tensorflow
I follow the instruction of Fine-tuning BERT to build a model with my own dataset(It is kind of large, and greater than 20G), then take steps to re-cdoe my data and load them from tf_record files.
...
1
vote
0
answers
103
views
TFDV generate_stats_from_csv trigger an out of memory error
i have a problem when generating stats over a small dataset (~10MB).
it takes to much time and consume to much memory (it reaches 25 GB of my memory which makes no sense ). and at the end it stops ...
5
votes
1
answer
787
views
Is it possible to run tensorflow-data-validation on MacOS with M1 chip?
Question: Is it possible to run tensorflow-data-validation on MacOS with M1 chip?
Steps taken: I have created a conda environment (tfdv38) in which I have installed the Mac-optimized TensorFlow.
I ...
1
vote
1
answer
452
views
In TFX, is it possible to infer Schema with dates?
I'm using TFX (more precisely TensorFlow Data Validation) with the infer_schema method documented there https://www.tensorflow.org/tfx/data_validation/api_docs/python/tfdv/infer_schema. It generates a ...
1
vote
1
answer
417
views
TensorFlow Extended: Specifying Valency Of Features In Schema
I am presently trying to feed a dataset with a few multivalent feature columns through a TensorFlow Extended (TFX) pipeline. Here is a row from my sample data:
user_id 29601
...
0
votes
1
answer
82
views
python pandas multi-level index getting a particular value
i have a pandas dataframe of grouped values every hour
2020.05.05 17:00:00 cpu_usage_specint 4.354603
phys_iops 3075.779680
total_memory ...
2
votes
1
answer
359
views
Why isn't SchemaGen supported in tfdv.display_schema()?
Regarding TFX' tensorflow-data-validation, I'm trying to understand when I should use *Gen components vs. using TFDV provided methods.
Specifically, what's confusing me is that I have this as my ...
0
votes
0
answers
276
views
tensorflow-data-validation doesn't work on large datasets with apache-beam direct runner because of grpc timeout
I’m running into a problem of tensorflow-data-validation with direct runner to generate statistics from some large datasets over 400GB.
It seems that all workers stopped working after an error message ...
2
votes
1
answer
876
views
What is the difference between implementation of 'drift_comparator' and 'skew_comparator' available in Tensorflow Data Validation?
Need help some in understanding the difference between the underlying implementation of skew and drift comparator in TFDV.
0
votes
0
answers
107
views
Keep getting ValueError while training on tf.data.datasets and include validation_data
Running the model.fit(...) method on my dataset. I keep getting errors like this:
ValueError: in user code:
...
ValueError: Could not find matching function to call loaded from the SavedModel. Got:
...
0
votes
0
answers
523
views
Unable to run Tensorflow Data Validation on Google Cloud Platform (Dataflow)
I have been trying to run TensorFlow Data Validation following Google Documents
Followed same steps as https://www.tensorflow.org/tfx/data_validation/install:
>pip install tensorflow-data-...
1
vote
3
answers
1k
views
Tensorflow Data Validation does not identify anomalies in numerical features
I've been testing Tensorflow Data Validation (version 0.22.0) to use in my current ML pipelines and I noticed it does not get any anomaly in numerical features. For instance,
> import pandas as ...
0
votes
4
answers
874
views
How do I save a TFDV stats in the correct format for them to be loaded back in?
It is puzzling to me that there is a tfdv.load_statistics() function, but no corresponding tfdv.write_statistics() function. How do I go about saving the statistics, and then loading them again?
e.g.
...
0
votes
1
answer
166
views
Tensorflow data validation for uniqueness
I'm using Tensorflow data validation and want to make sure a column has no duplicate values. But it seems Tensorflow's data validation doesn't have a feature like Deequ's isUnique function to check ...
1
vote
2
answers
2k
views
Understanding L-infinity norm which is used in TFDV
I was trying to implement TensorFlow Data Validation to check drift/skew in a dataset. They are using the L-infinity norm as a metric. I didn't understand the concept. Can anyone explain how it is ...
1
vote
0
answers
181
views
How can I install tensorflow_data_validation in python 3.7?
I use anaconda and I have python 3.7, I tried to install tensorflow_data_validation from anaconda's prompt but it gives me the following error: Could not find a version that satisfies the requirement ...
1
vote
1
answer
328
views
How to see all the possible options for schema metadata in tensorflow?
I am using tensorflow data validation and I am trying to build schemas around my datasets. I've built the initial schemas and I can see/edit them in notepad, but I'm having a hard time actually ...
1
vote
1
answer
3k
views
ERROR: Could not find a version that satisfies the requirement tensorflow-data-validation (from versions: none)
I get this error with Python 3.7 and Windows 10 64-bit (which is supported).
It seems that there are only wheels for Windows for Python 3.5 and 3.6...
Thanks in advance.
pip install tensorflow-data-...
2
votes
2
answers
483
views
How to enable numerical feature drift?
I want to enable numerical feature drift without setting a domain. Ex: I am scoring customers based on age and in my training set I had a uniform distribution of the variable. Now, in my serving data, ...
4
votes
1
answer
1k
views
Tensorflow model analysis, TFMA for keras model
I would like to use TFMA with keras model. The keras model was created with TF 2.0 alpha.
The model is a pretrained model with a classification layer:
...
1
vote
2
answers
431
views
Tensorflow data validation for tensorflow 2.0
Is there any version of Tensorflow data validation (tfdv) that can work with tensorflow 2.0 alpha ?
0
votes
1
answer
486
views
Reading Dataset statistics generated by StatisticsGen
I looked at the TFX guide (https://www.tensorflow.org/tfx/guide), and I was able to run the pipeline. The StatisticsGen component wrote Dataset statistics to a file on disk. How do I visualize the ...
0
votes
1
answer
200
views
TFX / Tensorflow Data Validation (TFDV) throws overflow encountered in float_scalars error
Following basic tutorial, and trying to generate stats for a TFRecord, I get a overflow error:
import tensorflow_data_validation as tfdv
stats = tfdv.generate_statistics_from_tfrecord(data_location='/...
0
votes
1
answer
358
views
Tensorflow TFDV does not work with Specific NaN values
I'm using Tensorflow Data Validation to generate stats from the data and infer an schema to input in TFX.
I didn't find any option to specify the NaN values, e. g., in pandas there is a field "...
1
vote
1
answer
711
views
Tensorflow TFDV does not work with images
I'm trying to get TFDV working with RGB images as feature inputs, reading from a TFRecords file. I can read/write the image data to TFRecord files fine. Here's the relevant code snippets for writing, ...
1
vote
1
answer
110
views
tensorflow data validation is crashing on 4 cores machine when I process more than 1.5 Gig CSV
I am trying to run tensor flow data validation feature on data sets (CSV) > 2 Gig. It is crashing after some time. It runs very well if data set is around 1 Gig. How to handle large data sets without ...
5
votes
3
answers
2k
views
TFDV Tensorflow Data Validation: how can I save/load the protobuf schema to/from a file
TFDV generates schema as a Schema protocol buffer. However it seems that there is no helper function to write/read schema to/from a file.
schema = tfdv.infer_schema(stats)
How can I save it/load it ?
1
vote
1
answer
196
views
pip install tensorflow-data-validation fails on OS X El Capitan (10.11.6)
I am not able to install tensorflow-data-validation using pip. when I use:
pip install tensorflow-data-validation
I get this error:
Could not find a version that satisfies the requirement
...