Practical Uses and Methods for Synthetic Data

Synthetic data is remarkably useful for many data science tasks and can even improve security. Join us for Free Code Fridays where Ted Dunning, Chief Application Architect at MapR, will use log-synth, an open-source program, to generate interesting randomized data.   Watch this 30-min demo to see how you can use log-synth to:

  • Make up names and addresses or sample from realistically perverse numerical distributions
  • Build data sets that can join cleanly but have long-tailed frequency distributions
  • Build fairly realistic session histories