-
Updated
Sep 11, 2021 - TypeScript
#
data-generation
Here are 112 public repositories matching this topic...
Random data generator.
testing
json
data
rest-api
random
randomization
random-generation
courtesy
human-data
data-generation
test-data
data-generator
test-data-generator
data-generators
List of useful data augmentation resources. You will find here some not common techniques, libraries, links to github repos, papers and others.
review
machine-learning
survey
generative-adversarial-network
style-transfer
data-generation
data-augmentation
data-synthesis
autoaugment
data-augmentations
augmentation-policies
-
Updated
Sep 1, 2021
Data generation and property-based testing for Elixir. 🔮
-
Updated
Jul 9, 2021 - Elixir
llogiq
commented
Sep 14, 2021
Required Functionality
Currently number
types can only be range
, categorical
(for integers) and constant
(also some can be Id
). There is no easy way to specify generating arbitrary numbers
Proposed Solution
Consider adding an unbounded
variant that will generate any valid number of the given type, alternatively make serde use the range default if no argument at all is give
Generate strings that match a given regular expression
-
Updated
Mar 23, 2021 - Ruby
Synthetic Data Generation for tabular, relational and time series data.
machine-learning
time-series
generative-adversarial-network
gan
data-generation
gans
synthetic-data
sdv
multi-table
synthetic-data-generation
relational-datasets
-
Updated
Sep 13, 2021 - Jupyter Notebook
Conditional GAN for generating synthetic tabular data.
tabular-data
generative-adversarial-network
data-generation
synthetic-data
synthetic-data-generation
-
Updated
Sep 14, 2021 - Python
MockNeat - the modern faker lib.
java
csv
big-data
randomization
faker
mocking
random-generation
data-generation
java-8
random-number-generators
lorem-ipsum
data-generator
faker-library
fake-data
faker-generator
randomizer
sample-data
sql-insert
arbitrary-data
sample-data-generator
-
Updated
Jul 26, 2021 - Java
Deep Convolutional Neural Networks for Musical Source Separation
theano
deep-learning
signal-processing
data-generation
convolutional-neural-networks
audio-synthesis
data-augmentation
source-separation
sample-querying
score-synthesis
-
Updated
Jan 31, 2020 - Python
A library to model multivariate data using copulas.
-
Updated
Sep 10, 2021 - Jupyter Notebook
Random dataframe and database table generator
python
data-science
database
generator
sqlite
pandas-dataframe
random-generation
data-generation
sqlite3
fake-data
synthetic-data
synthetic-dataset-generation
-
Updated
Jun 9, 2021 - Python
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and text alignment capabilities.
python
data-science
machine-learning
synthetic-images
data-generation
ner
ocr-recognition
text-alignment
synthetic-data
synthetic-data-generation
-
Updated
Aug 18, 2021 - Jupyter Notebook
The DataHelix generator allows you to quickly create data, based on a JSON profile that defines fields and the relationships between them, for the purpose of testing and validation
-
Updated
Jun 28, 2021 - Java
Custom image data generator for Keras supporting the use of modern augmentation modules
python
machine-learning
deep-learning
data-generation
image-classification
image-augmentation
augmentation
tensorflow2
augmentations
-
Updated
Sep 6, 2021 - Python
Ranger is contextual data generator used to make sensible data for integration tests or to play with it in the database
-
Updated
May 22, 2020 - Java
GRATIS: GeneRAting TIme Series with diverse and controllable characteristics
-
Updated
Sep 14, 2021 - R
Benerator is a leading software solution to generate, obfuscate, pseudonymize and migrate data for development, testing and training purposes.
java
obfuscate
migration
data-generation
performance-testing
testdata
anonymization
benerator
data-masking
data-modelling
databene
-
Updated
Sep 13, 2021 - Java
Neuralyzer is a library and a command line tool to anonymize databases (by updating existing data or populating a table with fake data)
database
data-generation
data-generator
anonymisation
data-privacy
anonymize
anonymization
rgpd
dgpr
private-life
-
Updated
Sep 10, 2021 - PHP
simstudy: Illuminating research methods through data generation
-
Updated
Aug 21, 2021 - R
NoiseMix - data generation for natural language
-
Updated
May 26, 2018 - Python
Just a small open-source script to create fake data given a simple JSON model.
open-source
npm
node
script
faker
data-generation
data-generator
hacktoberfest
fake-data
fixture-generator
-
Updated
Aug 10, 2021 - JavaScript
Generate relevant data quickly for your projects. The Databricks data generator can be used to generate large simulated / synthetic data sets for test, POCs, and other uses
-
Updated
Sep 13, 2021 - Python
Example repository showing how to utilise k6 and faker to load test using generated data
-
Updated
Aug 12, 2021 - JavaScript
Synthetic Data Generation for mixed-type, multivariate time series.
deep-learning
time-series
generative-adversarial-network
data-generation
synthetic-data
sdv
synthetic-data-generation
-
Updated
Sep 8, 2021 - Python
Using synthetic data in combination with Deep Learning, to determine if a system can be made that will be able to recognise and classify correctly real traffic signs.
machine-learning
computer-vision
deep-learning
keras
image-processing
synthetic-images
data-generation
image-manipulation
convolutional-neural-networks
automated
keras-neural-networks
traffic-sign-classification
synthetic-data
driver-assistant
-
Updated
May 28, 2018 - Jupyter Notebook
Star Schema Benchmark data set generator (dbgen) - unified repository
-
Updated
Dec 29, 2020 - C
A browser extension that fills registration forms with randomly but consistently generated fake data.
-
Updated
Sep 8, 2021 - JavaScript
Improve this page
Add a description, image, and links to the data-generation topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-generation topic, visit your repo's landing page and select "manage topics."
As see from #53 we can have broken/dead links, links that once worked can be unavailable for reasons outside the control of this project/repo!
Hence I have decided to manually scan (for now) the repo from time to time for such links and fix them - if there is one. Here are the steps to take:
New broken/dead links
markdown-link-check
(see https://www.npmjs.c