Package: daqapo 0.3.1

Niels Martin

daqapo: Data Quality Assessment for Process-Oriented Data

Provides a variety of methods to identify data quality issues in process-oriented data, which are useful to verify data quality in a process mining context. Builds on the class for activity logs implemented in the package 'bupaR'. Methods to identify data quality issues either consider each activity log entry independently (e.g. missing values, activity duration outliers,...), or focus on the relation amongst several activity log entries (e.g. batch registrations, violations of the expected activity order,...).

Authors:Niels Martin [aut, cre], Greg Van Houdt [ctb], Gert Janssenswillen [ctb]

daqapo_0.3.1.tar.gz


daqapo_0.3.1.tar.gz(r-4.4-noble)
daqapo_0.3.1.tgz(r-4.4-emscripten)daqapo_0.3.1.tgz(r-4.3-emscripten)
daqapo.pdf |daqapo.html
daqapo/json (API)

# Install 'daqapo' in R:
install.packages('daqapo', repos = c('https://bupaverse.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/nielsmartin/daqapo/issues

Datasets:

On CRAN:

46 exports 6 stars 1.33 score 83 dependencies 7 scripts 249 downloads

Last updated 2 years agofrom:2c90310c59. Checks:ERROR: 1. Indexed: yes.

TargetResultDate
Doc / VignettesFAILSep 18 2024

Exports:%>%activitylogassign_instance_idconvert_timestampsdetect_activity_frequency_violationsdetect_activity_order_violationsdetect_attribute_dependenciesdetect_case_id_sequence_gapsdetect_conditional_activity_presencedetect_duration_outliersdetect_inactive_periodsdetect_incomplete_casesdetect_incorrect_activity_namesdetect_missing_valuesdetect_multiregistrationdetect_overlapsdetect_related_activitiesdetect_resource_inconsistenciesdetect_similar_labelsdetect_time_anomaliesdetect_unique_valuesdetect_value_range_violationsdmydmy_hdmy_hmdmy_hmsdomain_categoricaldomain_numericdomain_timeduration_withinevents_to_activitylogfilter_anomaliesfixfix_resource_inconsistenciesmdymdy_hmdy_hmmdy_hmsread_csvread_csv2renamestandardize_lifecycleymdymd_hymd_hmymd_hms

Dependencies:base64encbitbit64bslibbupaRcachemclicliprcolorspacecommonmarkcpp11crayondata.tabledigestdplyredeaReventdataRfansifarverfastmapfontawesomeforcatsfsgenericsggplot2ggthemesgluegtablehmshtmltoolshttpuvisobandjquerylibjsonlitelabelinglaterlatticelifecyclelubridatemagrittrMASSMatrixmemoisemgcvmimeminiUImunsellnlmepillarpkgconfigprettyunitsprogresspromisespurrrR6rappdirsRColorBrewerRcppreadrrlangsassscalesshinyshinyTimesourcetoolsstringdiststringistringrtibbletidyrtidyselecttimechangetzdbutf8vctrsviridisLitevroomwithrxesreadRXMLxml2xtablezoo

Readme and manuals

Help Manual

Help pageTopics
daqapo - Data Quality Assessment for Process-oriented Datadaqapo
Check activity frequenciesdetect_activity_frequency_violations
Detect activity order violationsdetect_activity_order_violations detect_activity_order_violations.activitylog
Detect dependency violations between attributesdetect_attribute_dependencies
Detect gaps in case_iddetect_case_id_sequence_gaps
Detect conditional activity presence violationsdetect_conditional_activity_presence
Detect activity duration outliersdetect_duration_outliers
Detect inactive periodsdetect_inactive_periods
Detect incomplete casesdetect_incomplete_cases
Detect incorrect activity namesdetect_incorrect_activity_names
Detect missing valuesdetect_missing_values
Detect multi-registrationdetect_multiregistration
Detect overlapping acitivity instancesdetect_overlaps
Detect missing related activitiesdetect_related_activities
Search for similar labels in a columndetect_similar_labels
Detect time anomaliesdetect_time_anomalies
Search for unique values / distinct combinationsdetect_unique_values
Detect value range violationsdetect_value_range_violations
Define allowable range of valuesdomain_categorical
Define allowable range of valuesdomain_numeric
Define allowable time rangedomain_time
Define bounds for activity durationduration_within
Filter anomalies from the activity logfilter_anomalies
Fix problemsfix
An activity log of 20 patients in a hospital (data frame)hospital
An activity log of 20 patients in a hospital (activity log object)hospital_actlog
An event log of 20 patients in a hospitalhospital_events