Skip to content

An ES (JavaScript & TypeScript) module to dissect the string; Safe with the emojis, URLs, and words.

License

Notifications You must be signed in to change notification settings

hugoalh/string-dissect-es

Repository files navigation

String Dissect (ES)

βš–οΈ MIT

GitHub: hugoalh/string-dissect-es JSR: @hugoalh/string-dissect NPM: @hugoalh/string-dissect

An ES (JavaScript & TypeScript) module to dissect the string; Safe with the emojis, URLs, and words.

πŸ”° Begin

🎯 Targets

Remote JSR NPM
Bun >= v1.1.0 ❌ ❓ βœ”οΈ
Cloudflare Workers ❌ ❓ βœ”οΈ
Deno >= v1.42.0 βœ”οΈ βœ”οΈ βœ”οΈ
NodeJS >= v20.9.0 ❌ ❓ βœ”οΈ

Note

  • It is possible to use this module in other methods/ways which not listed in here, however those methods/ways are not officially supported, and should beware maybe cause security issues.

#️⃣ Resources Identifier

  • Remote - GitHub Raw:
    https://raw.githubusercontent.com/hugoalh/string-dissect-es/{Tag}/mod.ts
    
  • JSR:
    [jsr:]@hugoalh/string-dissect[@{Tag}]
    
  • NPM:
    [npm:]@hugoalh/string-dissect[@{Tag}]
    

Note

  • For usage of remote resources, it is recommended to import the entire module with the main path mod.ts, however it is also able to import part of the module with sub path if available, but do not import if:

    • it's path has an underscore prefix (e.g.: _foo.ts, _util/bar.ts), or
    • it is a benchmark or test file (e.g.: foo.bench.ts, foo.test.ts), or
    • it's symbol has an underscore prefix (e.g.: _bar, _foo).

    These elements are not considered part of the public API, thus no stability is guaranteed for them.

  • For usage of JSR or NPM resources, it is recommended to import the entire module with the main entrypoint, however it is also able to import part of the module with sub entrypoint if available, please visit the file jsr.jsonc property exports for available sub entrypoints.

  • It is recommended to use this module with tag for immutability.

πŸ›‘οΈ Runtime Permissions

This module does not request any runtime permission.

🧩 APIs

  • class StringDissector {
      constructor(options: StringDissectorOptions = {});
      dissect(item: string): Generator<StringSegmentDescriptor>;
    }
  • interface StringDissectorOptions {
      locales?: Intl.LocalesArgument;
      outputANSI?: boolean;
      safeURLs?: boolean;
      safeWords?: boolean;
    }
  • interface StringSegmentDescriptor {
      indexEnd: number;
      indexStart: number;
      type: StringSegmentType;
      value: string;
    }
  • type StringSegmentType = "ansi" | "character" | "emoji" | "url" | "word";

Note

✍️ Examples

  • const sample1 = "Vel ex sit est sit est tempor enim et voluptua consetetur gubergren gubergren ut.";
    
    /* Either */
    Array.from(new StringDissector().dissect(sample1));
    Array.from(dissectString(sample1));
    /*=>
    [
      { value: "Vel", type: "word" },
      { value: " ", type: "character" },
      { value: "ex", type: "word" },
      { value: " ", type: "character" },
      { value: "sit", type: "word" },
      { value: " ", type: "character" },
      { value: "est", type: "word" },
      { value: " ", type: "character" },
      ... +20
    ]
    */
    
    /* Either */
    Array.from(new StringDissector({ safeWords: false }).dissect(sample1));
    Array.from(dissectString(sample1, { safeWords: false }));
    /*=>
    [
      { value: "V", type: "character" },
      { value: "e", type: "character" },
      { value: "l", type: "character" },
      { value: " ", type: "character" },
      { value: "e", type: "character" },
      { value: "x", type: "character" },
      { value: " ", type: "character" },
      { value: "s", type: "character" },
      ... +73
    ]
    */
  • /* Either */
    Array.from(new StringDissector().dissect("GitHub homepage is https://github.com."));
    Array.from(dissectString("GitHub homepage is https://github.com."));
    /*=>
    [
      { value: "GitHub", type: "word" },
      { value: " ", type: "character" },
      { value: "homepage", type: "word" },
      { value: " ", type: "character" },
      { value: "is", type: "word" },
      { value: " ", type: "character" },
      { value: "https://github.com", type: "url" },
      { value: ".", type: "character" }
    ]
    */
  • /* Either */
    Array.from(new StringDissector().dissect("πŸ€πŸ’‘πŸ’πŸ‘ͺπŸ‘¨β€πŸ‘©β€πŸ‘§β€πŸ‘¦πŸ‘©β€πŸ‘¦πŸ‘©β€πŸ‘§β€πŸ‘¦πŸ§‘β€πŸ€β€πŸ§‘"), ({ value }) => {
      return value;
    });
    Array.from(dissectString("πŸ€πŸ’‘πŸ’πŸ‘ͺπŸ‘¨β€πŸ‘©β€πŸ‘§β€πŸ‘¦πŸ‘©β€πŸ‘¦πŸ‘©β€πŸ‘§β€πŸ‘¦πŸ§‘β€πŸ€β€πŸ§‘"), ({ value }) => {
      return value;
    });
    //=> [ "🀝", "πŸ’‘", "πŸ’", "πŸ‘ͺ", "πŸ‘¨β€πŸ‘©β€πŸ‘§β€πŸ‘¦", "πŸ‘©β€πŸ‘¦", "πŸ‘©β€πŸ‘§β€πŸ‘¦", "πŸ§‘β€πŸ€β€πŸ§‘" ]

About

An ES (JavaScript & TypeScript) module to dissect the string; Safe with the emojis, URLs, and words.

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Packages

No packages published