Fix with_regex_replacement_table mutator not behaving correctly when pattern only matches partly
0.6.3 (Dec 4, 2024)
Features
Improve randomized selection of replacements in mutators using replacement tables
Fixes
Fix phonetic replacement rules not being matched correctly against original data when the desired pattern occurs in multiple places
0.6.2 (Nov 27, 2024)
Features
Add placeholder option to with_generator for inserting generated values
0.6.1 (Nov 15, 2024)
Fixes
Fix indexing behavior in dfbitlookup when using NumPy data types
0.6.0 (Nov 15, 2024)
Breaking changes
Change mutator type definition from Callable[[list[pd.Series]], list[pd.Series]] to Callable[[list[pd.Series], Optional[float]], list[pd.Series]] to delegate the selection of rows to mutate to the mutators themselves
Generator and Mutator type definitions are now exported at the top level of the module
Replace D option in favor of d for unit parameter in with_datetime_offset
Remove strategy parameter from with_missing_value
Remove rng parameter from mutate_data_frame
Remove with_edit in favor of with_group
Features
with_replacement_table, with_regex_replacement_table and with_phonetic_replacement_table now favor rare replacements over common ones
Add rng parameter to with_function, with_lowercase, with_missing_value, with_noop, with_repeat, with_uppercase
with_permute now permutes series contents in a way that values are guaranteed to not remain in their original series
Add days, hours, minutes and seconds to list of permitted unit values for with_datetime_offset
Add list[str] as option to charset parameter of with_cldr_keymap_file, with_insert and with_substitute
Fixes
When providing a list of mutators to a column in mutate_data_frame, all mutators are now applied to all rows instead of with a 1 / mutator_count probability
Fix with_regex_replacement_table interpreting numbers in pattern and substitution columns as belonging to a named capture group
Documentation
Use section-style navigation instead of tabs in Gecko docs
0.5.2 (Nov 5, 2024)
Features
Add generator.with_group for grouping multiple (weighted) generators together
Internal
Remove automated benchmarks
0.5.1 (Oct 30, 2024)
Features
Add the option to use data frames for all generators and mutators that accept paths to CSV files
0.5.0 (Oct 23, 2024)
Breaking changes
to_data_frame has a new call signature that ensures that it's consistent with mutate_data_frame