mcp-hfspace

Votes: 0

In this example, we specify the filename for microsoft/OmniParser to use, and get returned an annotated Image and 2 separate pieces of text: descriptions and coordinates. The prompt used was use omniparser to analyse ./screenshot.png and use the analysis to produce an artifact that reproduces that screen. DawnC/Pawmatch is also good at this.

GitHub: https://github.com/evalstate/mcp-hfspace

Language: Typescript

License: MIT

Official: No

Categories:

LocalImage & Video ProcessingSpeech Processing