BARALDI, LORENZO
 Distribuzione geografica
Continente #
NA - Nord America 13.760
EU - Europa 9.459
AS - Asia 8.398
SA - Sud America 839
AF - Africa 108
OC - Oceania 58
Continente sconosciuto - Info sul continente non disponibili 22
Totale 32.644
Nazione #
US - Stati Uniti d'America 13.481
IT - Italia 4.243
SG - Singapore 2.640
CN - Cina 2.379
GB - Regno Unito 1.674
HK - Hong Kong 1.272
DE - Germania 812
BR - Brasile 701
SE - Svezia 631
VN - Vietnam 538
KR - Corea 334
RU - Federazione Russa 326
FR - Francia 289
JP - Giappone 275
FI - Finlandia 255
NL - Olanda 213
ID - Indonesia 201
IN - India 173
CA - Canada 167
UA - Ucraina 158
ES - Italia 148
TR - Turchia 132
AT - Austria 101
TW - Taiwan 100
IE - Irlanda 99
BG - Bulgaria 87
MX - Messico 83
BE - Belgio 66
PL - Polonia 61
CH - Svizzera 56
AR - Argentina 54
BD - Bangladesh 54
AU - Australia 50
RO - Romania 42
ZA - Sudafrica 39
LT - Lituania 37
GR - Grecia 35
MY - Malesia 32
DK - Danimarca 31
PK - Pakistan 30
IL - Israele 27
AE - Emirati Arabi Uniti 26
IQ - Iraq 25
PT - Portogallo 25
EC - Ecuador 19
EU - Europa 19
CL - Cile 18
IR - Iran 18
SA - Arabia Saudita 18
CZ - Repubblica Ceca 17
TH - Thailandia 15
UZ - Uzbekistan 15
DZ - Algeria 14
KZ - Kazakistan 13
CO - Colombia 12
MA - Marocco 11
VE - Venezuela 11
KE - Kenya 10
PE - Perù 10
TN - Tunisia 10
AZ - Azerbaigian 9
PY - Paraguay 9
BZ - Belize 8
EG - Egitto 8
LU - Lussemburgo 8
PH - Filippine 8
JO - Giordania 7
KH - Cambogia 7
NZ - Nuova Zelanda 7
SC - Seychelles 7
SK - Slovacchia (Repubblica Slovacca) 7
HU - Ungheria 6
NO - Norvegia 6
NP - Nepal 6
BH - Bahrain 5
EE - Estonia 5
MO - Macao, regione amministrativa speciale della Cina 5
AL - Albania 4
CY - Cipro 4
HR - Croazia 4
JM - Giamaica 4
KG - Kirghizistan 4
RS - Serbia 4
SN - Senegal 4
SY - Repubblica araba siriana 4
BB - Barbados 3
DO - Repubblica Dominicana 3
GE - Georgia 3
LB - Libano 3
LK - Sri Lanka 3
LV - Lettonia 3
PS - Palestinian Territory 3
CR - Costa Rica 2
ET - Etiopia 2
GT - Guatemala 2
MD - Moldavia 2
MN - Mongolia 2
PA - Panama 2
QA - Qatar 2
UY - Uruguay 2
Totale 32.619
Città #
Singapore 1.635
Fairfield 1.362
Santa Clara 1.322
Ashburn 1.260
Hong Kong 1.063
Southend 958
Hefei 859
Modena 855
Chandler 774
Woodbridge 668
Seattle 608
Houston 592
Cambridge 520
Wilmington 442
Beijing 421
Ann Arbor 384
London 342
Nyköping 342
Los Angeles 282
Milan 257
Dearborn 247
Bologna 245
Jacksonville 235
Seoul 197
Chicago 186
Buffalo 183
Ho Chi Minh City 178
Rome 173
Jakarta 172
Boardman 158
New York 154
Helsinki 139
Reggio Emilia 137
San Diego 134
Tokyo 128
Munich 121
Council Bluffs 118
Parma 109
Hanoi 102
Nuremberg 98
The Dalles 92
Shanghai 90
Princeton 89
Sofia 83
Redwood City 79
Kent 73
Dublin 72
Izmir 72
São Paulo 72
Amsterdam 71
Dong Ket 69
Moscow 61
Phoenix 60
Dallas 59
Eugene 59
Florence 59
Frankfurt am Main 56
San Jose 55
Salt Lake City 51
Bomporto 50
Montreal 50
Bremen 49
Pisa 47
Vienna 47
Formigine 45
Naples 43
Paris 42
Taipei 41
Mexico City 40
Warsaw 40
Piacenza 39
Brussels 37
Zurich 37
Fremont 36
Manchester 35
Orem 35
Ottawa 34
Toronto 33
Chennai 32
Falls Church 32
Lappeenranta 32
Trento 32
Düsseldorf 30
Lauterbourg 29
Nanjing 29
Tampa 29
Falkenstein 28
Portsmouth 28
San Francisco 28
Turin 28
Central 27
Johannesburg 27
Turku 27
Casalgrande 26
Denver 26
Bari 25
Copenhagen 25
Guangzhou 25
Seo-gu 25
Brooklyn 24
Totale 20.876
Nome #
Attentive Models in Vision: Computing Saliency Maps in the Deep Learning Era 490
Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models 446
MissRAG: Addressing the Missing Modality Challenge in Multimodal Large Language Models 437
Artpedia: A New Visual-Semantic Dataset with Visual and Contextual Sentences in the Artistic Domain 429
Spaghetti Labeling: Directed Acyclic Graphs for Block-Based Connected Components Labeling 429
What was Monet seeing while painting? Translating artworks to photo-realistic images 428
Modeling Multimodal Cues in a Deep Learning-based Framework for Emotion Recognition in the Wild 421
Attentive Models in Vision: Computing Saliency Maps in the Deep Learning Era 419
Automatic Image Cropping and Selection using Saliency: an Application to Historical Manuscripts 406
YACCLAB - Yet Another Connected Components Labeling Benchmark 387
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions 375
M-VAD Names: a Dataset for Video Captioning with Naming 374
Layout analysis and content classification in digitized books 370
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model 355
Connected Components Labeling on DRAGs 351
Explaining Digital Humanities by Aligning Images and Textual Descriptions 348
Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions 348
Aligning Text and Document Illustrations: towards Visually Explainable Digital Humanities 347
Visual-Semantic Alignment Across Domains Using a Semi-Supervised Approach 346
A Deep Multi-Level Network for Saliency Prediction 344
Recognizing social relationships from an egocentric vision perspective 344
A Hierarchical Quasi-Recurrent approach to Video Captioning 338
Learning to Read L'Infinito: Handwritten Text Recognition with Synthetic Training Data 337
Optimized Connected Components Labeling with Pixel Prediction 336
A Browsing and Retrieval System for Broadcast Videos using Scene Detection and Automatic Annotation 336
Historical Document Digitization through Layout Analysis and Deep Content Classification 335
A Deep Siamese Network for Scene Detection in Broadcast Videos 334
A Video Library System Using Scene Detection and Automatic Tagging 333
Hierarchical Boundary-Aware Neural Encoder for Video Captioning 332
Shot and Scene Detection via Hierarchical Clustering for Re-using Broadcast Video 328
Analysis and Re-use of Videos in Educational Digital Libraries with Automatic Scene Detection 328
Image-to-Image Translation to Unfold the Reality of Artworks: an Empirical Analysis 324
Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation 318
Connected Components Labeling on DRAGs: Implementation and Reproducibility Notes 315
Unveiling the Impact of Image Transformations on Deepfake Detection: An Experimental Analysis 314
Hand Segmentation for Gesture Recognition in EGO-Vision 313
Dual-Branch Collaborative Transformer for Virtual Try-On 311
Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation 310
Measuring scene detection performance 310
The Revolution of Multimodal Large Language Models: A Survey 308
SAM: Pushing the Limits of Saliency Prediction Models 308
Context Change Detection for an Ultra-Low Power Low-Resolution Ego-Vision Imager 307
Gesture Recognition using Wearable Vision Sensors to Enhance Visitors' Museum Experiences 306
Recognizing and Presenting the Storytelling Video Structure with Deep Multimodal Networks 303
Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs 302
Multi-Level Net: a Visual Saliency Prediction Model 302
LAMV: Learning to align and match videos with kernelized temporal layers 294
Visual Saliency for Image Captioning in New Multimedia Services 290
Gesture Recognition in Ego-Centric Videos using Dense Trajectories and Hand Segmentation 286
Ai4ar: An ai-based mobile application for the automatic generation of ar contents 283
Towards Cycle-Consistent Models for Text and Image Retrieval 280
From Show to Tell: A Survey on Deep Learning-based Image Captioning 279
SynthCap: Augmenting Transformers with Synthetic Data for Image Captioning 277
Towards Video Captioning with Naming: a Novel Dataset and a Multi-Modal Approach 276
Scene-driven Retrieval in Edited Videos using Aesthetic and Semantic Deep Features 276
Explore and Explain: Self-supervised Navigation and Recounting 273
Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention 272
A Novel Attention-based Aggregation Function to Combine Vision and Language 271
Towards Reliable Experiments on the Performance of Connected Components Labeling Algorithms 259
Meshed-Memory Transformer for Image Captioning 257
Scene segmentation using temporal clustering for accessing and re-using broadcast video 256
Multimodal Attention Networks for Low-Level Vision-and-Language Navigation 248
Tracing Information Flow in LLaMA Vision: A Step Toward Multimodal Understanding 245
Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities 239
Retrieval-Augmented Transformer for Image Captioning 236
CaMEL: Mean Teacher Learning for Image Captioning 233
NeuralStory: an Interactive Multimedia System for Video Indexing and Re-use 230
A Unified Cycle-Consistent Neural Model for Text and Image Retrieval 228
A Deep-learning-based approach to VM behavior Identification in Cloud Systems 222
A Computational Approach for Progressive Architecture Shrinkage in Action Recognition 220
Revisiting The Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis 215
Boosting Modern and Historical Handwritten Text Recognition with Deformable Convolutions 214
Learning to Select: A Fully Attentive Approach for Novel Object Captioning 214
Investigating Bidimensional Downsampling in Vision Transformer Models 212
Embodied Agents for Efficient Exploration and Smart Scene Description 211
Hyperbolic Safety-Aware Vision-Language Models 210
Adapt to Scarcity: Few-Shot Deepfake Detection via Low-Rank Adaptation 208
Video action detection by learning graph-based spatio-temporal interactions 208
Focus on Impact: Indoor Exploration with Intrinsic Motivation 204
The Unreasonable Effectiveness of CLIP features for Image Captioning: an Experimental Analysis 204
RMS-Net: Regression and Masking for Soccer Event Spotting 203
Intelligent Multimodal Artificial Agents that Talk and Express Emotions 197
Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters 196
Improving Indoor Semantic Segmentation with Boundary-level Objectives 190
Semantically Conditioned Prompts for Visual Recognition under Missing Modality Scenarios 188
BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues 187
Assessing the Role of Boundary-level Objectives in Indoor Semantic Segmentation 184
Working Memory Connections for LSTM 182
With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning 181
Matching Faces and Attributes Between the Artistic and the Real Domain: the PersonArt Approach 181
ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval 179
Multimodal Emotion Recognition in Conversation via Possible Speaker's Audio and Visual Sequence Selection 177
FOSSIL: Free Open-Vocabulary Semantic Segmentation through Synthetic References Retrieval 177
Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval 176
Estimating (and fixing) the Effect of Face Obfuscation in Video Recognition 176
Towards Explainable Navigation and Recounting 174
Are Learnable Prompts the Right Way of Prompting? Adapting Vision-and-Language Models with Memory Optimization 174
Embodied Navigation at the Art Gallery 172
Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates 168
Superpixel Positional Encoding to Improve ViT-based Semantic Segmentation Models 168
Totale 28.142
Categoria #
all - tutte 118.965
article - articoli 0
book - libri 0
conference - conferenze 0
curatela - curatele 0
other - altro 0
patent - brevetti 0
selected - selezionate 0
volume - volumi 0
Totale 118.965


Totale Lug Ago Sett Ott Nov Dic Gen Feb Mar Apr Mag Giu
2020/20212.401 0 0 0 0 0 459 235 335 289 562 279 242
2021/20223.237 183 139 240 169 84 214 186 230 311 335 825 321
2022/20232.767 364 299 246 227 301 283 92 226 386 75 144 124
2023/20242.525 243 170 246 312 439 164 120 183 70 201 131 246
2024/20258.439 710 242 260 496 1.229 925 512 650 1.081 567 803 964
2025/20267.531 1.084 780 1.263 1.528 2.134 742 0 0 0 0 0 0
Totale 33.261