| commit | 737570847f2c6b0a27266aa00662308f19f4de60 | [log] [tgz] |
|---|---|---|
| author | Philip Zeyliger <philip@bold.dev> | Sat Jun 07 00:06:27 2025 +0000 |
| committer | Philip Zeyliger <philip@bold.dev> | Sat Jun 07 04:12:57 2025 +0000 |
| tree | c4bc73898f0d8be2723efd00866f4e53eb279532 | |
| parent | 5f778949e023268d0bd39cb7f102952e575e9602 [diff] |
webui: add image paste functionality to mobile chat input Add comprehensive file paste support to mobile input matching desktop functionality with upload progress indicators and error handling. Problem Analysis: Mobile webui lacked image paste functionality available on desktop, preventing users from easily sharing screenshots, images, and files via clipboard paste. This created workflow friction for mobile users who needed to upload images for code assistance, debugging, or documentation purposes. Implementation Changes: 1. Clipboard Paste Handler: - Added paste event listener for detecting clipboard files - Integrated with existing upload infrastructure via fetch API - Cursor position preservation for inline file insertion - Prevented default paste behavior for file content 2. File Upload Infrastructure: - Reused desktop upload patterns with FormData and ./upload endpoint - Added loading placeholder with emoji indicator during upload - Implemented error handling with user-friendly error messages - File path insertion in [bracket] format matching desktop behavior 3. Upload Progress Management: - Added upload counter to track multiple concurrent uploads - Upload progress indicator with file count display - Disabled send button during active uploads to prevent premature sending - Visual feedback with hourglass icon and tooltip messages 4. Mobile-Optimized UI: - Positioned upload progress indicator above input field - Touch-friendly styling with appropriate spacing - Responsive design for various mobile screen sizes - Integrated seamlessly with existing mobile input styling 5. Lifecycle Management: - Proper event listener setup and cleanup - Component disconnection cleanup to prevent memory leaks - Textarea reference management for event binding - Focus restoration after upload completion Technical Details: - Uses ClipboardEvent.clipboardData.files for file detection - FormData upload with POST to ./upload endpoint matching desktop - Upload state management with progress counter and visual indicators - Cursor position tracking via selectionStart for inline insertion - Error handling with try/catch and user feedback Benefits: - Consistent file upload experience across desktop and mobile - Improved mobile workflow for image sharing and file uploads - Visual feedback during upload process prevents user confusion - Proper error handling with informative messages - Seamless integration with existing mobile chat functionality Testing: - Verified paste event detection works on mobile browsers - Confirmed upload progress indicators display correctly - Tested error handling with network failures - Validated file path insertion and cursor positioning This enhancement brings mobile webui file paste functionality to feature parity with desktop while maintaining mobile-specific optimizations. Co-Authored-By: sketch <hello@sketch.dev> Change-ID: s495601fcaa89f012k
Sketch is an agentic coding tool. It draws the 🦉
Sketch runs in your terminal, has a web UI, understands your code, and helps you get work done. To keep your environment pristine, sketch starts a docker container and outputs its work onto a branch in your host git repository.
Sketch helps with most programming environments, but Sketch has extra goodies for Go.
go install sketch.dev/cmd/sketch@latest sketch
Currently, Sketch runs on macOS and Linux. It uses Docker for containers.
| Platform | Installation |
|---|---|
| macOS | brew install colima (or Docker Desktop/Orbstack) |
| Linux | apt install docker.io (or equivalent for your distro) |
| WSL2 | Install Docker Desktop for Windows (docker entirely inside WSL2 is tricky) |
The sketch.dev service is used to provide access to an LLM service and give you a way to access the web UI from anywhere.
Start Sketch by running sketch in a Git repository. It will open your browser to the Sketch chat interface, but you can also use the CLI interface. Use -open=false if you want to use just the CLI interface.
Ask Sketch about your codebase or ask it to implement a feature. It may take a little while for Sketch to do its work, so hit the bell (🔔) icon to enable browser notifications. We won't spam you or anything; it will notify you when the Sketch agent's turn is done, and there's something to look at.
When you start Sketch, it:
This design lets you run multiple sketches in parallel since they each have their own sandbox. It also lets Sketch work without worry: it can trash its own container, but it can't trash your machine.
Sketch's agentic loop uses tool calls (mostly shell commands, but also a handful of other important tools) to allow the LLM to interact with your codebase.
Sketch is trained to make Git commits. When those happen, they are automatically pushed to the git repository where you started sketch with branch names sketch/*.
Finding Sketch branches:
git branch -a --sort=creatordate | grep sketch/ | tail
The UI keeps track of the latest branch it pushed and displays it prominently. You can use standard Git workflows to pull those branches into your workspace:
git cherry-pick $(git merge-base origin/main sketch/foo)
or merge the branch
git merge sketch/foo
or reset to the branch
git reset --hard sketch/foo
Ie use the same workflows you would if you were pulling in a friend's Pull Request.
Advanced: You can ask Sketch to git fetch sketch-host and rebase onto another commit. This will also fetch where you started Sketch, and we do a bit of "git fetch refspec configuration" to make origin/main work as a git reference.
Don't be afraid of asking Sketch to help you rebase, merge/squash commits, rewrite commit messages, and so forth; it's good at it!
The diff view shows you changes since Sketch started. Leaving comments on lines adds them to the chat box, and, when you hit Send (at the bottom of the page), Sketch goes to work addressing your comments.
You can interact directly with the container in three ways:
ssh sketch-ilik-eske-tcha-lott. We have automatically configured your SSH configuration to make these special hostnames work.Using SSH (and/or VSCode) allows you to forward ports from the container to your machine. For example, if you want to start your development webserver, you can do something like this:
# Forward container port 8888 to local port 8000 ssh -L8000:localhost:8888 sketch-ilik-epor-tfor-ward go run ./cmd/server
This makes http://localhost:8000/ on your machine point to localhost:8888 inside the container.
You can ask Sketch to browse a web page and take screenshots. There are tools both for taking screenshots and "reading images", the latter of which sends the image to the LLM. This functionality is handy if you're working on a web page and want to see what the in-progress change looks like.
Docker images, containers, and so forth tend to pile up. Ask Docker to prune unused images and containers:
docker system prune -a
See CONTRIBUTING.md for development guidelines.
Sketch is open source. It is right here in this repository! Have a look around and mod away.
If you want to run Sketch entirely without the sketch.dev service, you can set the flag -skaband-addr="" and then provide an ANTHROPIC_API_KEY environment variable. (More LLM services coming soon!)