开源软件名称(OpenSource Name):PAIR-code/facets开源软件地址(OpenSource Url):https://github.com/PAIR-code/facets开源编程语言(OpenSource Language):Jupyter Notebook 62.6%开源软件介绍(OpenSource Introduction):IntroductionThe facets project contains two visualizations for understanding and analyzing machine learning datasets: Facets Overview and Facets Dive. The visualizations are implemented as Polymer web components, backed by Typescript code and can be easily embedded into Jupyter notebooks or webpages. Live demos of the visualizations can be found on the Facets project description page. Facets OverviewOverview gives a high-level view of one or more data sets. It produces a visual feature-by-feature statistical analysis, and can also be used to compare statistics across two or more data sets. The tool can process both numeric and string features, including multiple instances of a number or string per feature. Overview can help uncover issues with datasets, including the following:
Key aspects of the visualization are outlier detection and distribution comparison across multiple datasets. Interesting values (such as a high proportion of missing data, or very different distributions of a feature across multiple datasets) are highlighted in red. Features can be sorted by values of interest such as the number of missing values or the skew between the different datasets. The python code to generate the statistics for visualization can be installed through Details about Overview usage can be found in its README. Facets DiveDive is a tool for interactively exploring up to tens of thousands of multidimensional data points, allowing users to seamlessly switch between a high-level overview and low-level details. Each example is a represented as single item in the visualization and the points can be positioned by faceting/bucketing in multiple dimensions by their feature values. Combining smooth animation and zooming with faceting and filtering, Dive makes it easy to spot patterns and outliers in complex data sets. Details about Dive usage can be found in its README. SetupUsage in Google Colabratory/Jupyter NotebooksUsing Facets in Google Colabratory and Jupyter notebooks can be seen in this notebook. These notebooks work without the need to first download/install this repository. Both Facets visualizations make use of HTML imports. So in order to use them, you must first load the appropriate polyfill, through Note that for using Facets Overview in a Jupyter notebook, there are two considerations:
When visualizing a large amount of data in Dive in a Juypter notebook, as is done in the Dive demo Jupyter notebook, you will need to start the notebook server with an increased IOPub data rate.
This can be done with the command Code Installation
Building the VisualizationsIf you make code changes to the visualization and would like to rebuild them, follow these directions:
Using the rebuilt Visualizations in a Jupyter notebookIf you want to use the visualizations you built locally in a Jupyter notebook, follow these directions:
Known Issues
Disclaimer: This is not an official Google product |
2023-10-27
2022-08-15
2022-08-17
2022-09-23
2022-08-13
请发表评论