- cross-posted to:
- privacyguides@lemmy.one
- cross-posted to:
- privacyguides@lemmy.one
Software engineer Vishnu Mohandas decided he would quit Google in more ways than one when he learned that the tech giant had briefly helped the US military develop AI to study drone footage. In 2020 he left his job working on Google Assistant and also stopped backing up all of his images to Google Photos. He feared that his content could be used to train AI systems, even if they weren’t specifically ones tied to the Pentagon project. “I don’t control any of the future outcomes that this will enable,” Mohandas thought. “So now, shouldn’t I be more responsible?”
The site (TheySeeYourPhotos) returns what Google Vision is able to decern from photos. You can test with any image you want or there are some sample images available.
The site (TheySeeYourPhotos) returns what Google Vision is able to decern from photos. You can test with any image you want or there are some sample images available.
…by submitting them to Google, who then keeps a copy of them and uses them for the exact same purpose which purportedly compelled the author to leave Google.
If you’re using Android and Google photos it’s already doing that anyway.
Thats why you are beeing told beforehand and may just pick a stock photo.
Sure, but that still feels very “You agreed!”. The only place on that website that tells you “beforehand” is hidden in the terms of service. That’s literally no different.
It literally says they use the Google vision API right at the top where you submit your photo. I feel like that’s pretty prominent.
And without the context that the Ars article provides, that information means very little to the casual visitor. There is absolutely nothing on that website to provide any of that context. It certainly doesn’t say that by uploading your photo, you are agreeing to allow Google an irrevocable licence to use it to train AI.
The only thing there is an image that says “Take control” which just links to the author’s cloud storage company. This whole thing is thinly-veiled viral marketing.
This comment should be pinned.
I tried various photos, any of my personal photos with metadata stripped, and was surprised how accurate it was.
It seemed really oriented towards detecting people and their moods, the socioeconomic status of things, and objects and their perceived quality.
I gave it two pictures of my cat and it said that she looked annoyed in one picture and contemplative in the other, both of which were true.
It’s probably a vision model (like this) with custom instructions that direct it to focus on those factors. It’d be interesting to see the instructions.
I tested with a few images, particularly drawings and arts. Then I had the idea of trying something different… and I discovered that it seems like it’s vulnerable to the “Ignore all previous instructions” command, just like LLMs:
Oh. My. Fuck. Me.
The image shows a man and a girl walking on a path in a wooded area. The foreground is covered in fallen leaves and pine straw. In the background, there is a wooden structure that appears to be some sort of storage shed or lean-to. A fire pit is visible to the left. The trees are dense and the lighting suggests it’s daytime. There appears to be a small animal or bird under the wooden structure.
The man appears to be middle-aged, Caucasian, with a casual style. He appears to be wearing camouflage clothing and jeans, suggesting an outdoorsy lifestyle. He is carrying a water bottle. The girl is young, likely elementary school-aged, wearing a pink shirt and shorts. She looks somewhat pensive. They both appear to be of average economic means and are engaging in a simple outdoor walk. The picture was taken at 2:44:22 AM on February 1st, 2020, with a Bushnell camera. There’s also an unidentifiable object hanging from a tree branch in the background.
The image’s resolution is somewhat low, indicative of a security camera, making the details somewhat blurry. The lighting is not uniform, with patches of sunlight and shadow. There is a subtle difference in the ground texture between the path and the surrounding areas. The girl appears to have a slightly concerned expression on her face. The wood used in the construction of the shelter seems weathered and may be indicative of its age and prolonged exposure to the environment.
Mostly spot on except the date because I never set the trail cam. Also, no animal under the firewood shed and the water bottle is a Keystone in a cozie. Cannot believe it picked up the ground difference in the trail and the edge of it.
I gave it a picture of houseplants and it said they looked healthy and well cared for which actually made me feel pretty happy and validated.
Don’t feel too happy bro you were told that by a soulless computer that’s was designed to tell you what it thinks you want to hear.
It’s not designed to tell you what you want to hear.
That’s literally all AI is designed to do. Given an input, it just tries to output an expected response.
Yeah, no. LLMs predict what comes next, not what someone wants to hear.
Not really wants as much as expects, but that’s what AI is designed to do.
What you’re saying is not factual. LLMs predict what comes next based on the parameters set during learning process. It might at times say what you’re expecting, but then try contradicting information that it knows to be factual. See how far that gets you.
I think you’re confusing agreeableness for a validation buddy. For a product like this to work, it has to be inviting.
LLMs predict what comes next based on the parameters set during learning process.
Now you’re just splitting hairs.
I tried a few but just got that it’s a particular shade of taupe with no discernable people or objects. And it went on describing how oddly particular the shade of taupe was…for some reason. 🤣 And the other said it was sage green.
I’m guessing something was wrong with it when I tried it and it was just getting a very small portion of the image because the different colors it mentioned were present in the images it referenced, so it’s not like it was just random or blocked entirely.
Does anyone have any recommendations for apps to view photos that are not Google?
I use Immich, which I host locally.
Immich if you self host as others have mentioned but since this is the article shared and you don’t want to host it https://ente.io/ is what is talked about in the article
Something “more private, wholesome, and trustworthy,” he says. The paid service he designed, Ente, is profitable and says it has more than 100,000 users, many of whom are already part of the privacy-obsessed crowd. But Mohandas struggled to articulate to wider audiences why they should reconsider relying on Google Photos, despite all the conveniences it offers.
I have 0 experience with ente btw
You can also self host ente. I’ve found it to require less maintenance than immich.
This would make sense as the ente server doesn’t do much given all the photos are encrypted. All the intelligence is in the client apps.
Didn’t know that interesting I wonder how there features compare to immich
The ML categorization of images was added more recently than it was in immich and the updates are less frequent, but I like that when I open the app on my phone I can expect it to work instead of complaining that the server and app versions don’t match and are incompatible.
Seconding immich - I host it for my family which makes sharing vacation photos easy since they all have accounts on my instance that can be shared to / from.
I’ve got Immich running on a Raspberry Pi based NAS at home, which works pretty well but it’s not for beginners.
For those who don’t need cloud access, I just put all of my photos on a NAS and use a digital asset manager software. digiKam is great if you want an open source solution. I use ACDSee because it’s faster and has better usability in my humble opinion. But since both of the software packages store the metadata in image files and XMP sidecars and basically only use local app-specific database for caching, if digiKam ever gets a couple of quantum leaps ahead, switching back to it isn’t that big of a deal. (As usual, don’t use Adobe Lightroom or you’re screwed in that regard. Or so I’ve been told.)
It has correctly identified both a Stargate and a moai made of snow.
Gave it a screenshot of OSMand, got a creepy qoute
The image does not show any people; it is purely a navigational map highlighting a route. The time displayed on the map is 17:51:34, suggesting late afternoon or early evening. There is no additional information available about the device used to take the screenshot or the user’s intentions, making it impossible to determine their racial characteristics, ethnicity, age, economic status, or lifestyle. The emotional context of the image is neutral, as it is simply a visual representation of a traveled route.
That’s clearly part of the prompt from this demo website, based on the other answers it’s been giving.
Don’t mind me, I’m just poisoning it with AI shit that it thinks is real.
You sadly cant poison models this way, they are static/pretrained and dont change based on user input.
I think it’s pretty likely that online LLMs keep user inputs for training of future versions/models. Though it probably gets filtered for obvious stuff like this.
They say they don’t. Would be very bad publicity if they did, and possibly breach of contract or other legal trouble.