Visokio website     Downloads     Video tutorials     KnowledgeBase  
de-duplicate: How to define the 'younger' entry to be kept / the older entry to be deleted? - Visokio Forums
de-duplicate: How to define the 'younger' entry to be kept / the older entry to be deleted?
  •     dvo_D December 14, 2010 2:58PM
    There are data sets to be updated.
    How can I tell the system to identify the 'old' data by entries in defined columns, so Omniscope will replace the complete row with a newer entry from another source? - 'Age' of data is to be identified by a column which is included in data from the 2nd source "corrections", but not in the 1st source "main data".
    For better understanding, please take a look on attached xls file.

    The data set to be kept is green,
    to be deleted: orange,
    Facts to identify the duplicate: light blue,
    Facts to identify the younger entry: dark blue,

    Source is the controlled circulation reporting (quarterly) of print titles and its correction (weekly updated) by "IVW.de".
  • 1 Comment
  •     steve January 10, 2011 2:21PM
    I presume by the presence of the Source column you have merged two blocks in DataManager to produce this result.

    Using DataManager, add a Sort operation before a De-duplicate operation. See the attached file.

    If you can't get it to work with larger amounts of data, you might need to use the Replace operation to change the blank value to some imaginary date way in the past, before doing the Sort + De-dup.
    Attachments
    De-dup order.iok 17K

Welcome!

It looks like you're new here. If you want to get involved, click one of these buttons!

Sign In Apply for Membership