Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
xnx
on Dec 5, 2024
|
parent
|
context
|
favorite
| on:
PaliGemma 2: Powerful Vision-Language Models, Simp...
Yes: "The initial four location tokens represent the coordinate of the bounding box, ranging from 0 to 1023. These coordinates are independent of the aspect ratio, as the image is assumed to be resized to 1024 x 1024."
https://developers.googleblog.com/en/gemma-explained-paligem...
exe34
on Dec 5, 2024
[–]
Thank you! Will have a play with that.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search:
https://developers.googleblog.com/en/gemma-explained-paligem...