Home Data Prep Q&A

Paxata Community Members: Something special in a community experience is coming your way. Stay tuned to this space.
In the meantime, check out the brand new Data Prep for Data Science topic here and the new DataRobot Community.

Visit the official Paxata Documentation portal for all of your doc needs.

Question regarding cluster + edit feature (ngram algorithm)

Hello,

I don't know how values are grouped when using ngram.

Could you please tell me that with simple sample data?

Best Regards,
Momoko

Best Answer

Answers

  • Momoko FukudaMomoko Fukuda Posts: 2
    edited April 2, 2020 5:39AM
    Hi,

    My apologies for the delay response and I appreciate your answer.
    Please let me ask you another question.
    I tried testing with the data below;

    test
    aaabcde
    fgaaahij
    klmnoaaa
    opraastu

    I set "3" at "NGram Size" as parameter and I thought the three values "aaabcde", "fgaaahij" and "klmnoaaa" could be grouped because they have "aaa", but it didn't work.
    What kind of data can I group with? or should I change the parameter?

    Best Regards,
    Momoko
Sign In or Register to comment.