![](https://lemmy.dbzer0.com/pictrs/image/35225f3a-d729-4f92-86d4-89b4d85b7c86.jpeg)
![](https://fry.gs/pictrs/image/c6832070-8625-4688-b9e5-5d519541e092.png)
Yes, that’s why I’m proposing it as opposed to just one pixel to differentiate between ad and video. Youtube videos are already separated in sections, just add some metadata with a hash to every one.
Yes, that’s why I’m proposing it as opposed to just one pixel to differentiate between ad and video. Youtube videos are already separated in sections, just add some metadata with a hash to every one.
That is prone to error, just a pixel can be too small of a sample. I would prefer something with hashes, just a sha1sum every 5 seconds of the current frame. It can be computed while buffering videos and wait until the ad is over to splice the correct region
It will make it possible to do the TTC glitch consistently saving half an A press
Then you forget about local models that can’t generate text as polished as hosted ones but will not have the watermark
It happens to me sometimes but you can hop instances and usually it ends up working
Thank you!
Last I saw, you had to compile it from source, can you drop a link?
Yes, that could be an alternative to computing hashes, I don’t know what option would be less resource intensive