System and method for data compaction and encryption of anonymized data records

DWPI Title: System for facilitating data compaction and encryption of anonymized data records, has library manager for comprising multiple programming instructions stored in memory, and processor for receiving anonymized source blocks from data deconstruction engine for each sourceblock
Abstract: A system and method for data compaction and encryption of anonymized data records. A dataset may be pre-processed by dividing into sourceblocks at reasonable intervals and tallying each sourceblock's frequency, creating a tally record of tokens and count values. This tally record may then be anonymized and transmitted to a data deconstruction engine which combined with a library manager creates a codebook and performs optimization techniques on the codebook. The data deconstruction engine and library manager may be distributed across multiple nodes or devices. The received anonymized tally record may be parsed into individual tokens by identifying the tokens with the highest count value. The tokens may then be sent descending order of count value to the library manger where each token may be assigned a codeword. A half-backed codebook is then created using the tokens and each token's unique codeword, before sending the half-backed codebook to a system user.
Use: System for facilitating data compaction and encryption of anonymized data records.
Advantage: The system enables progressive optimization, parallel processing, and efficient resource utilization, makes the system scalable, adaptable, and modular, allows handle larger datasets and complex optimization requirements, reduces size for efficient operation, eliminates the least-probable entries from word library up to a given threshold so as to allow data gathering or generating devices to process and train on data gathered locally, which improve system latency and optimize available network resources.
Novelty: The system has a processor for receiving an anonymized tally record from a user, where anonymized tally record comprises multiple anonymized source blocks for each anonymized source block and a tally value indicating the number of times. The processor parses anonymized source blocks to identify anonymized source blocks with highest tally value, sends the anonymized source blocks to a library manager (103), receives a half-backed code book from the library manager, and transmits half-backed code book to a user. The library manager comprises multiple programming instructions stored in a memory. The processor receives anonymized source blocks from a data deconstruction engine for each source block.
Filed: 6/7/2024
Application Number: US18737474A
Tech ID: SD 14754.1
This invention was made with Government support under Contract No. DE-NA0003525 awarded by the United States Department of Energy/National Nuclear Security Administration. The Government has certain rights in the invention.
Data from Derwent World Patents Index, provided by Clarivate
All rights reserved. Republication or redistribution of Clarivate content, including by framing or similar means, is prohibited without the prior written consent of Clarivate. Clarivate and its logo, as well as all other trademarks used herein are trademarks of their respective owners and used under license.