Commit 7880528
authored
* WIP: Creating complex example
* WIP: Creating complex example
* Minor linter fixes
* Added synthetic data
* Trying to get test pipeline to work: works with decision tree classifier, but not keras nn
* Example works without w2v
* Complex example runs. TODO: mlinspect support for it
* Removed unnecessary statement
* Commented out stuff in complex example to make pipeline run
* Added join Dag Node
* Fixed a bug in code_reference_to_module extraction
* Fixed a bug involving chained method calls
* Groupby Dag nodes work in progress
* Added end_lineno and end_col_offset to CodeReference to support chained function call edge cases
* WIP: fixing tests after code_reference change
* WIP: fixing tests after code_reference change
* WIP: fixing tests after code_reference change
* WIP: fixing tests after code_reference change
* All tests work again after modifying code_reference
* Groupby Aggregate Dag node with description
* Discovered a huge bug with subscript instrumentation
* Resulting changes from instrumentation bug fix
* Dag Extraction works for projections with lists of columns as argument
* Found way how subscript assigns might be possible
* Index-Assingn wir extraction works
* Index-Assign runs, but no dag extraction yet. Todo: add module info
* Index-Assign is doable but requires other operators to work
* Added some comments
* Select does not work yet but does not throw errors anymore
* Train-Test-Split no longer causes mlinspect to fail, still need to implement wir_extraction for tuple unpacking
* Added WIR support for tuple unpacking in assignments
* Imputer now in Dag
* Nested pipeline DAG creation starting to work. TODO: Delete left-over original nodes after copying for column transformer
* Nested pipeline DAG extraction works
* Added support for W2V transformer
* Dag Extraction works for whole complex pipeline, only some runtime stuff is missing that requires analyzer instrumentation
* Fixed a test
* Subscript Assign Dag nodes work completely now
* Projection/selection differentiation for df.__getitem__ almost works
* Select Dag node works now
* Finished selection changes and updated tests
* Added a TODO
* Adding analyzer support statement by statement: Data Source
* WIP: join
* WIP: join
* Pandas Backend now supports Joins
* Pandas Backend now supports Group by Aggs by treating it as a data source
* Pandas Backend now supports the 2nd join (found bug in inital join implementation)
* Preparing set label
* Pandas Backend now supports set label syntax
* Fixed bug introduced in last commit
* Projection with double list syntax already works
* Select by series works
* Train-test-splits work, although we'll need to revisit them once we model the test set in the DAG
* Started with sklearn pipeline
* Analyzer support for simple version of the sklearn pipeline
* Analyzer support for W2V transformer
* Analyzers work for complex example. Some code will need to be cleaned up though. TODO: Write demo analyzers
* Moved demo into nb and new directory
* Fixed bug with print score
* Some cleanup. For some reason, due to moving the demo_utils, the healthcare example is slower
* More cleanup
* More cleanup
* Started with Demo Analyzer. Works: Propagating age_group. TODO: Histograms, Embeddings
* Propagating age_group and race to calculate histograms
* First histogram plots work
* Histograms for Data Source and Groupby Agg if col available
* Added matplotlib to dependencies
* Included race histograms
* Add missing embedding inspection
* Update demo notebook, added simple pipeline time measurement. Note: iterators are not shared yet between inspections, so each inspection will slow everything down a bit
* Added a simple lineage inspection for demo purposes
* Renamed analyzers to inspections
* Improved package structure a bit
* Some cleanup
* More cleanup
* Changes from cidr version of repo
* Disable demo notebook image output when running corresponding pytest test
* Update readme after renaming analyzers to inspections
1 parent f9adae0 commit 7880528
File tree
61 files changed
+5181
-1449
lines changed- demo
- healthcare
- mlinspect
- backends
- inspections
- instrumentation
- analyzers
- backends
- requirements
- test
- backends
- demo
- inspections
- instrumentation
- backends
- pipelines
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
61 files changed
+5181
-1449
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
36 | 36 | | |
37 | 37 | | |
38 | 38 | | |
39 | | - | |
| 39 | + | |
40 | 40 | | |
41 | 41 | | |
42 | 42 | | |
43 | 43 | | |
44 | 44 | | |
45 | | - | |
| 45 | + | |
46 | 46 | | |
47 | 47 | | |
48 | 48 | | |
49 | | - | |
| 49 | + | |
50 | 50 | | |
51 | 51 | | |
52 | 52 | | |
| |||
File renamed without changes.
File renamed without changes.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
Loading
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
Large diffs are not rendered by default.
0 commit comments