BARALDI, LORENZO
 Distribuzione geografica
Continente #
NA - Nord America 14.283
EU - Europa 9.516
AS - Asia 8.680
SA - Sud America 850
AF - Africa 109
OC - Oceania 58
Continente sconosciuto - Info sul continente non disponibili 22
Totale 33.518
Nazione #
US - Stati Uniti d'America 13.999
IT - Italia 4.271
SG - Singapore 2.734
CN - Cina 2.401
GB - Regno Unito 1.682
HK - Hong Kong 1.284
DE - Germania 815
BR - Brasile 709
SE - Svezia 631
VN - Vietnam 543
KR - Corea 335
RU - Federazione Russa 326
FR - Francia 289
JP - Giappone 277
FI - Finlandia 263
ID - Indonesia 241
NL - Olanda 214
IN - India 181
CA - Canada 170
UA - Ucraina 158
ES - Italia 149
TR - Turchia 136
TW - Taiwan 109
AT - Austria 101
IE - Irlanda 99
BG - Bulgaria 89
MX - Messico 85
BE - Belgio 67
PL - Polonia 63
MY - Malesia 61
BD - Bangladesh 57
CH - Svizzera 57
AR - Argentina 56
TH - Thailandia 51
AU - Australia 50
RO - Romania 42
ZA - Sudafrica 39
LT - Lituania 37
GR - Grecia 35
DK - Danimarca 31
IL - Israele 30
PK - Pakistan 30
AE - Emirati Arabi Uniti 26
PT - Portogallo 26
IQ - Iraq 25
PH - Filippine 20
EC - Ecuador 19
EU - Europa 19
IR - Iran 19
CL - Cile 18
SA - Arabia Saudita 18
CZ - Repubblica Ceca 17
UZ - Uzbekistan 16
DZ - Algeria 14
KZ - Kazakistan 13
CO - Colombia 12
VE - Venezuela 12
MA - Marocco 11
KE - Kenya 10
PE - Perù 10
TN - Tunisia 10
AZ - Azerbaigian 9
PY - Paraguay 9
BZ - Belize 8
EG - Egitto 8
LU - Lussemburgo 8
JO - Giordania 7
KH - Cambogia 7
NZ - Nuova Zelanda 7
SC - Seychelles 7
SK - Slovacchia (Repubblica Slovacca) 7
HU - Ungheria 6
NO - Norvegia 6
NP - Nepal 6
BH - Bahrain 5
EE - Estonia 5
MO - Macao, regione amministrativa speciale della Cina 5
AL - Albania 4
CY - Cipro 4
HR - Croazia 4
JM - Giamaica 4
KG - Kirghizistan 4
RS - Serbia 4
SN - Senegal 4
SY - Repubblica araba siriana 4
BB - Barbados 3
DO - Repubblica Dominicana 3
ET - Etiopia 3
GE - Georgia 3
LB - Libano 3
LK - Sri Lanka 3
LV - Lettonia 3
MD - Moldavia 3
PS - Palestinian Territory 3
CR - Costa Rica 2
GT - Guatemala 2
MN - Mongolia 2
PA - Panama 2
QA - Qatar 2
UY - Uruguay 2
Totale 33.493
Città #
Singapore 1.718
Fairfield 1.362
Santa Clara 1.322
Ashburn 1.318
Hong Kong 1.075
Southend 958
Hefei 859
Modena 856
Chandler 774
Woodbridge 668
Seattle 608
Houston 593
Cambridge 520
Wilmington 442
Beijing 425
Ann Arbor 384
San Jose 351
London 342
Nyköping 342
Los Angeles 288
Milan 257
Bologna 247
Dearborn 247
Jacksonville 235
Jakarta 211
Seoul 197
Chicago 192
Buffalo 183
Ho Chi Minh City 179
Rome 176
New York 160
Boardman 158
Helsinki 141
Reggio Emilia 138
Council Bluffs 135
San Diego 134
Tokyo 130
Munich 123
The Dalles 119
Parma 109
Hanoi 105
Nuremberg 98
Shanghai 90
Princeton 89
Sofia 84
Redwood City 79
São Paulo 74
Kent 73
Dublin 72
Izmir 72
Amsterdam 71
Dong Ket 69
Phoenix 62
Dallas 61
Moscow 61
Eugene 59
Florence 59
Frankfurt am Main 56
Salt Lake City 56
Montreal 51
Bomporto 50
Taipei 50
Bremen 49
Pisa 47
Vienna 47
Bangkok 46
Formigine 45
Naples 44
Orem 43
Paris 42
Warsaw 42
Mexico City 41
Piacenza 39
Zurich 38
Brussels 37
Fremont 36
Manchester 35
Chennai 34
Lappeenranta 34
Ottawa 34
Toronto 33
Falls Church 32
Trento 32
Wilmette 32
Kuala Selangor 31
Tampa 31
Düsseldorf 30
Lauterbourg 29
Nanjing 29
Falkenstein 28
Portsmouth 28
San Francisco 28
Turin 28
Central 27
Denver 27
Johannesburg 27
Turku 27
Casalgrande 26
Guangzhou 26
Poplar 26
Totale 21.527
Nome #
Attentive Models in Vision: Computing Saliency Maps in the Deep Learning Era 496
MissRAG: Addressing the Missing Modality Challenge in Multimodal Large Language Models 455
Connected Components Labeling on DRAGs 454
Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models 453
Spaghetti Labeling: Directed Acyclic Graphs for Block-Based Connected Components Labeling 439
What was Monet seeing while painting? Translating artworks to photo-realistic images 434
Artpedia: A New Visual-Semantic Dataset with Visual and Contextual Sentences in the Artistic Domain 432
Attentive Models in Vision: Computing Saliency Maps in the Deep Learning Era 425
Modeling Multimodal Cues in a Deep Learning-based Framework for Emotion Recognition in the Wild 423
Automatic Image Cropping and Selection using Saliency: an Application to Historical Manuscripts 407
YACCLAB - Yet Another Connected Components Labeling Benchmark 393
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions 381
M-VAD Names: a Dataset for Video Captioning with Naming 376
Layout analysis and content classification in digitized books 374
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model 362
Aligning Text and Document Illustrations: towards Visually Explainable Digital Humanities 354
Connected Components Labeling on DRAGs: Implementation and Reproducibility Notes 354
Visual-Semantic Alignment Across Domains Using a Semi-Supervised Approach 352
Explaining Digital Humanities by Aligning Images and Textual Descriptions 352
Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions 351
A Deep Multi-Level Network for Saliency Prediction 350
Recognizing social relationships from an egocentric vision perspective 349
Learning to Read L'Infinito: Handwritten Text Recognition with Synthetic Training Data 346
A Hierarchical Quasi-Recurrent approach to Video Captioning 344
A Deep Siamese Network for Scene Detection in Broadcast Videos 342
Optimized Connected Components Labeling with Pixel Prediction 340
Historical Document Digitization through Layout Analysis and Deep Content Classification 340
A Video Library System Using Scene Detection and Automatic Tagging 339
A Browsing and Retrieval System for Broadcast Videos using Scene Detection and Automatic Annotation 338
Hierarchical Boundary-Aware Neural Encoder for Video Captioning 337
Analysis and Re-use of Videos in Educational Digital Libraries with Automatic Scene Detection 334
Shot and Scene Detection via Hierarchical Clustering for Re-using Broadcast Video 331
Image-to-Image Translation to Unfold the Reality of Artworks: an Empirical Analysis 325
Hand Segmentation for Gesture Recognition in EGO-Vision 321
Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation 321
Unveiling the Impact of Image Transformations on Deepfake Detection: An Experimental Analysis 320
SAM: Pushing the Limits of Saliency Prediction Models 320
Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation 316
Context Change Detection for an Ultra-Low Power Low-Resolution Ego-Vision Imager 316
Dual-Branch Collaborative Transformer for Virtual Try-On 315
Measuring scene detection performance 314
The Revolution of Multimodal Large Language Models: A Survey 311
Gesture Recognition using Wearable Vision Sensors to Enhance Visitors' Museum Experiences 311
Recognizing and Presenting the Storytelling Video Structure with Deep Multimodal Networks 308
Multi-Level Net: a Visual Saliency Prediction Model 308
Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs 307
LAMV: Learning to align and match videos with kernelized temporal layers 307
Visual Saliency for Image Captioning in New Multimedia Services 294
Gesture Recognition in Ego-Centric Videos using Dense Trajectories and Hand Segmentation 290
Ai4ar: An ai-based mobile application for the automatic generation of ar contents 287
Towards Cycle-Consistent Models for Text and Image Retrieval 283
From Show to Tell: A Survey on Deep Learning-based Image Captioning 283
Explore and Explain: Self-supervised Navigation and Recounting 281
Scene-driven Retrieval in Edited Videos using Aesthetic and Semantic Deep Features 281
SynthCap: Augmenting Transformers with Synthetic Data for Image Captioning 280
Towards Video Captioning with Naming: a Novel Dataset and a Multi-Modal Approach 279
A Novel Attention-based Aggregation Function to Combine Vision and Language 276
Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention 275
Towards Reliable Experiments on the Performance of Connected Components Labeling Algorithms 261
Meshed-Memory Transformer for Image Captioning 261
Scene segmentation using temporal clustering for accessing and re-using broadcast video 260
Multimodal Attention Networks for Low-Level Vision-and-Language Navigation 255
Tracing Information Flow in LLaMA Vision: A Step Toward Multimodal Understanding 251
Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities 246
CaMEL: Mean Teacher Learning for Image Captioning 240
Retrieval-Augmented Transformer for Image Captioning 240
A Unified Cycle-Consistent Neural Model for Text and Image Retrieval 238
NeuralStory: an Interactive Multimedia System for Video Indexing and Re-use 235
A Computational Approach for Progressive Architecture Shrinkage in Action Recognition 230
A Deep-learning-based approach to VM behavior Identification in Cloud Systems 229
Embodied Agents for Efficient Exploration and Smart Scene Description 219
Revisiting The Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis 218
Learning to Select: A Fully Attentive Approach for Novel Object Captioning 217
Investigating Bidimensional Downsampling in Vision Transformer Models 217
Boosting Modern and Historical Handwritten Text Recognition with Deformable Convolutions 214
Adapt to Scarcity: Few-Shot Deepfake Detection via Low-Rank Adaptation 212
Hyperbolic Safety-Aware Vision-Language Models 212
Video action detection by learning graph-based spatio-temporal interactions 211
RMS-Net: Regression and Masking for Soccer Event Spotting 210
Focus on Impact: Indoor Exploration with Intrinsic Motivation 208
The Unreasonable Effectiveness of CLIP features for Image Captioning: an Experimental Analysis 207
Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval 204
Intelligent Multimodal Artificial Agents that Talk and Express Emotions 203
Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters 203
BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues 195
Improving Indoor Semantic Segmentation with Boundary-level Objectives 194
Semantically Conditioned Prompts for Visual Recognition under Missing Modality Scenarios 193
Assessing the Role of Boundary-level Objectives in Indoor Semantic Segmentation 189
Matching Faces and Attributes Between the Artistic and the Real Domain: the PersonArt Approach 186
Estimating (and fixing) the Effect of Face Obfuscation in Video Recognition 186
Multimodal Emotion Recognition in Conversation via Possible Speaker's Audio and Visual Sequence Selection 185
ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval 185
With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning 184
Working Memory Connections for LSTM 184
FOSSIL: Free Open-Vocabulary Semantic Segmentation through Synthetic References Retrieval 182
Are Learnable Prompts the Right Way of Prompting? Adapting Vision-and-Language Models with Memory Optimization 180
Towards Explainable Navigation and Recounting 179
Embodied Navigation at the Art Gallery 175
Superpixel Positional Encoding to Improve ViT-based Semantic Segmentation Models 173
Unlearning Vision Transformers without Retaining Data via Low-Rank Decompositions 171
Totale 28.828
Categoria #
all - tutte 121.336
article - articoli 0
book - libri 0
conference - conferenze 0
curatela - curatele 0
other - altro 0
patent - brevetti 0
selected - selezionate 0
volume - volumi 0
Totale 121.336


Totale Lug Ago Sett Ott Nov Dic Gen Feb Mar Apr Mag Giu
2020/20211.942 0 0 0 0 0 0 235 335 289 562 279 242
2021/20223.237 183 139 240 169 84 214 186 230 311 335 825 321
2022/20232.767 364 299 246 227 301 283 92 226 386 75 144 124
2023/20242.525 243 170 246 312 439 164 120 183 70 201 131 246
2024/20258.439 710 242 260 496 1.229 925 512 650 1.081 567 803 964
2025/20268.405 1.084 780 1.263 1.528 2.134 1.059 557 0 0 0 0 0
Totale 34.135