How does data cleaning work?
  • 21 Apr 2022
  • 1 Minute to read
  • Contributors
  • Dark
    Light
  • PDF

How does data cleaning work?

  • Dark
    Light
  • PDF

Article Summary

There's nothing more annoying than getting deep into a troubleshooting session or escalating issues only to find out you were working with bad data. 😫

That's why we developed our data cleaning feature - so that you can control what makes the cut and avoid having it impact your aggregations and analysis work.

116853621d2e17db6f2598d4310084b2ccleaning.jpg

What is data cleaning?

When data is ingested, Skylight analytics will run it through a filter to tag any suspicious data as "dirty". This bad data is then excluded from computations and visualizations to avoid skewing the results.

Data cleaning rules apply to directional measurements, meaning we'll only mark measurements for a specific direction of P2R, R2P or RT of a session as "dirty", and leave the measurements for other directions as usable when rule thresholds are hit, etc..

1304164899825915e3af4fdac4ccde019image.png

Where do I enable it?

There are two types of data cleaning:

  1. Automatic - this is based on error codes from Accedian sources and is always enabled.
  2. User defined rules - scrub what and how makes sense to your deployment.

Automatic data cleaning

We will tag data as "dirty" based on error codes in the data files sent from Accedian sources. These rules are not configurable. If the error code reported as part of the statStatus is not 0, the result record will be flagged. This applies to DMM, echo-udp. twamp-sf, twamp-sl and eth-dmm session types.

image.png

Refer to Error Codes Documentation for more information on these.

User defined data cleaning rules

Custom cleaning rules can be configurable by admins in the application settings.
image.png

You can add a rule for any metric from any object type. The rule format is that when a metric is either ≥ or ≤ a fixed value for a sustained period of time (minimum of 1 minute), then mark all metrics for that data point and for the same direction as "dirty".

Remember - data cleaning occurs at ingestion time meaning that changes to rules will only apply to records being processed from that time on.


How do I know it's working?

These are the icon you will see to determine data cleaning is off or on

OnOff

Example in action

datacleaning.gif

© 2024 Accedian Networks Inc. All rights reserved. Accedian®, Accedian Networks®,  the Accedian logo™, Skylight™, Skylight Interceptor™ and per-packet intel™, are trademarks or registered trademarks of Accedian Networks Inc. To view a list of Accedian trademarks visit: http://accedian.com/legal/trademarks/. 


Was this article helpful?

Changing your password will log you out immediately. Use the new password to log back in.
First name must have atleast 2 characters. Numbers and special characters are not allowed.
Last name must have atleast 1 characters. Numbers and special characters are not allowed.
Enter a valid email
Enter a valid password
Your profile has been successfully updated.