- Mark as New
- Subscribe to RSS Feed
- Permalink
‎27 Mar 2024 02:40 PM
Hello!
I have log lines when a customer makes a purchase. This log contains the ID of the purchased products. I would like to be able to quantify the best-selling products of the day.
Example:
message: TUP15451, TUP41234, TUP52351152.
It's possible?
Solved! Go to Solution.
- Mark as New
- Subscribe to RSS Feed
- Permalink
‎27 Mar 2024 04:17 PM - edited ‎27 Mar 2024 04:17 PM
Considering the productID is a field, you can do "| summarize count(), by: {productID} " at the end of your query.
- Mark as New
- Subscribe to RSS Feed
- Permalink
‎27 Mar 2024 05:23 PM
but I have multiple products, something like: "PRODUCTS: TPU 123, TPU 1942, TPU9724"
- Mark as New
- Subscribe to RSS Feed
- Permalink
‎27 Mar 2024 05:26 PM
You mean, in the same line you have multiple values for the productID field?
- Mark as New
- Subscribe to RSS Feed
- Permalink
‎27 Mar 2024 05:33 PM
Yes!!
I'm almost there!What's left is to refine my parseAll because it's not getting the last value
@dannemca wrote:You mean, in the same line you have multiple values for the productID field?
data record(timestamp = "12/24/2022", content = "Products: SKU1234, SKU1234, SKU12345"),
record(timestamp = "12/25/2022", content = "Products: SKU123, SKU1234, SKU12345")
| fieldsAdd ProductID = parseAll(content, "BLANK LD:pid PUNCT")
| expand ProductID
| summarize count(), alias: Occurrences, by:ProductID
- Mark as New
- Subscribe to RSS Feed
- Permalink
‎27 Mar 2024 05:47 PM
Got it now, try to replace the LD by WORD in your parseAll:
data record(timestamp = "12/24/2022", content = "Products: SKU1234, SKU1234, SKU12345"),
record(timestamp = "12/25/2022", content = "Products: SKU123, SKU1234, SKU12345")
| fieldsAdd ProductID = parseAll(content, "BLANK WORD:pid")
| expand ProductID
| summarize count(), alias: Occurrences, by:ProductID
Regards.
- Mark as New
- Subscribe to RSS Feed
- Permalink
‎28 Mar 2024 01:32 PM
We received the official registration format. Perhaps it would be possible to adapt parse to capture what is just between {[[ and ]]}?
I tried with DATA "{[[" but I was unsuccessful
PARITY_CATEGORY and fields: {[[TKU12345, TKU123456, TKU123451, TKU59156]]}
- Mark as New
- Subscribe to RSS Feed
- Permalink
‎28 Mar 2024 06:22 PM
If you have comma separated array of words it quite easy to do using just DPL or DPL + splitString:
data record(content="blabla blabla Products: {[[TKU12345, TKU123456, TKU123451, TKU59156]]} bla bla")
| parse content, "DATA '{[[' LD:products ']]}' DATA"
| fieldsAdd product=trim(splitString(products,",")[])
or
data record(content="blabla blabla Products: {[[TKU12345, TKU123456, TKU123451, TKU59156]]} bla bla")
| parse content, "DATA '{[[' ARRAY{WORD*:i ', '*}{1,100}:product ']]}' DATA"
Kris
- Mark as New
- Subscribe to RSS Feed
- Permalink
‎28 Mar 2024 08:05 PM
| parse content, "DATA '{[[' ARRAY{WORD*:i ', '*}{1,100}:product ']]}' DATA"
this worked great! Thank you very much!
Could you explain to me how it works?