mcp-hfspace
Votes: 0
In this example, we specify the filename for microsoft/OmniParser to use, and get returned an annotated Image and 2 separate pieces of text: descriptions and coordinates. The prompt used was use omniparser to analyse ./screenshot.png and use the analysis to produce an artifact that reproduces that screen. DawnC/Pawmatch is also good at this.
GitHub: https://github.com/evalstate/mcp-hfspace
Language: Typescript
License: MIT
Official: No
Categories:
LocalImage & Video ProcessingSpeech Processing