BARALDI, LORENZO
 Distribuzione geografica
Continente #
NA - Nord America 9.256
EU - Europa 5.604
AS - Asia 1.409
SA - Sud America 27
OC - Oceania 26
AF - Africa 23
Continente sconosciuto - Info sul continente non disponibili 21
Totale 16.366
Nazione #
US - Stati Uniti d'America 9.164
IT - Italia 2.568
GB - Regno Unito 1.208
SE - Svezia 478
CN - Cina 442
DE - Germania 442
HK - Hong Kong 364
FR - Francia 171
UA - Ucraina 136
FI - Finlandia 132
TR - Turchia 111
JP - Giappone 108
BG - Bulgaria 81
VN - Vietnam 81
CA - Canada 75
NL - Olanda 74
SG - Singapore 70
KR - Corea 65
IN - India 60
BE - Belgio 46
IE - Irlanda 41
ES - Italia 39
RU - Federazione Russa 33
RO - Romania 32
MY - Malesia 23
AU - Australia 21
DK - Danimarca 21
CH - Svizzera 20
TW - Taiwan 20
EU - Europa 19
GR - Grecia 18
BR - Brasile 17
AT - Austria 16
PL - Polonia 14
IL - Israele 11
DZ - Algeria 10
CZ - Repubblica Ceca 9
ID - Indonesia 9
IR - Iran 9
PT - Portogallo 8
BZ - Belize 7
MX - Messico 7
BD - Bangladesh 6
CL - Cile 6
EG - Egitto 5
KZ - Kazakistan 5
NZ - Nuova Zelanda 5
TN - Tunisia 5
HR - Croazia 4
KH - Cambogia 4
NO - Norvegia 4
TH - Thailandia 4
BH - Bahrain 3
HU - Ungheria 3
PE - Perù 3
PH - Filippine 3
SA - Arabia Saudita 3
BB - Barbados 2
CY - Cipro 2
LU - Lussemburgo 2
PK - Pakistan 2
RS - Serbia 2
A1 - Anonimo 1
A2 - ???statistics.table.value.countryCode.A2??? 1
AE - Emirati Arabi Uniti 1
AR - Argentina 1
BA - Bosnia-Erzegovina 1
EE - Estonia 1
ET - Etiopia 1
GL - Groenlandia 1
IQ - Iraq 1
MN - Mongolia 1
MO - Macao, regione amministrativa speciale della Cina 1
SC - Seychelles 1
ZA - Sudafrica 1
Totale 16.366
Città #
Fairfield 1.360
Southend 958
Ashburn 792
Chandler 774
Woodbridge 668
Modena 659
Seattle 575
Houston 571
Cambridge 517
Wilmington 435
Ann Arbor 380
Nyköping 342
Hong Kong 297
Dearborn 247
Jacksonville 233
Beijing 215
San Diego 134
Boardman 122
Bologna 113
Milan 104
Princeton 89
Helsinki 84
Redwood City 79
Sofia 78
Izmir 72
Buffalo 69
Dong Ket 69
London 61
New York 60
Eugene 59
Parma 59
Rome 53
Reggio Emilia 52
Bomporto 50
Bremen 49
Tokyo 40
Phoenix 35
Falls Church 32
Munich 31
Ottawa 30
Pisa 30
San Jose 29
Brussels 28
Florence 28
Central 27
Fremont 27
Nanjing 27
Formigine 26
Paris 24
Amsterdam 23
Varese 22
Prata Di Pordenone 19
Shanghai 19
Copenhagen 17
Dublin 17
Frankfurt am Main 17
Piacenza 17
Singapore 17
Toronto 17
Bari 16
Los Angeles 16
Trento 15
Vigevano 15
Ghedi 14
Ponte San Pietro 14
Utrecht 14
Kraków 13
Turin 13
Zurich 13
Castelnuovo Rangone 12
Guangzhou 12
Norwalk 12
A Coruña 11
Chiswick 11
Venezia 11
Verona 11
Carpi 10
Chicago 10
Dallas 10
Hanoi 10
Menlo Park 10
Scandiano 10
Seoul 10
Naples 9
San Francisco 9
Vienna 9
Berkeley 8
Grafing 8
Livorno 8
Padova 8
Porto Mantovano 8
Reggio Nell'emilia 8
Romainville 8
Santa Clara 8
Taipei 8
Washington 8
Agrigento 7
Apo 7
Belize City 7
Catania 7
Totale 11.506
Nome #
Attentive Models in Vision: Computing Saliency Maps in the Deep Learning Era 343
Artpedia: A New Visual-Semantic Dataset with Visual and Contextual Sentences in the Artistic Domain 326
What was Monet seeing while painting? Translating artworks to photo-realistic images 321
Modeling Multimodal Cues in a Deep Learning-based Framework for Emotion Recognition in the Wild 306
Automatic Image Cropping and Selection using Saliency: an Application to Historical Manuscripts 296
Spaghetti Labeling: Directed Acyclic Graphs for Block-Based Connected Components Labeling 296
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions 288
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model 281
Layout analysis and content classification in digitized books 274
Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions 268
YACCLAB - Yet Another Connected Components Labeling Benchmark 266
Recognizing social relationships from an egocentric vision perspective 266
M-VAD Names: a Dataset for Video Captioning with Naming 264
Attentive Models in Vision: Computing Saliency Maps in the Deep Learning Era 259
Explaining Digital Humanities by Aligning Images and Textual Descriptions 259
Hierarchical Boundary-Aware Neural Encoder for Video Captioning 258
Visual-Semantic Alignment Across Domains Using a Semi-Supervised Approach 256
Historical Document Digitization through Layout Analysis and Deep Content Classification 247
Connected Components Labeling on DRAGs 247
Optimized Connected Components Labeling with Pixel Prediction 246
Image-to-Image Translation to Unfold the Reality of Artworks: an Empirical Analysis 246
Multi-Level Net: a Visual Saliency Prediction Model 241
LAMV: Learning to align and match videos with kernelized temporal layers 236
Recognizing and Presenting the Storytelling Video Structure with Deep Multimodal Networks 235
A Deep Multi-Level Network for Saliency Prediction 233
Connected Components Labeling on DRAGs: Implementation and Reproducibility Notes 232
Analysis and Re-use of Videos in Educational Digital Libraries with Automatic Scene Detection 231
A Video Library System Using Scene Detection and Automatic Tagging 229
Shot and Scene Detection via Hierarchical Clustering for Re-using Broadcast Video 227
Aligning Text and Document Illustrations: towards Visually Explainable Digital Humanities 227
Gesture Recognition using Wearable Vision Sensors to Enhance Visitors' Museum Experiences 225
SAM: Pushing the Limits of Saliency Prediction Models 225
Hand Segmentation for Gesture Recognition in EGO-Vision 224
Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation 224
Visual Saliency for Image Captioning in New Multimedia Services 222
A Browsing and Retrieval System for Broadcast Videos using Scene Detection and Automatic Annotation 222
Measuring scene detection performance 220
Context Change Detection for an Ultra-Low Power Low-Resolution Ego-Vision Imager 218
Towards Cycle-Consistent Models for Text and Image Retrieval 212
Gesture Recognition in Ego-Centric Videos using Dense Trajectories and Hand Segmentation 207
A Deep Siamese Network for Scene Detection in Broadcast Videos 206
A Hierarchical Quasi-Recurrent approach to Video Captioning 206
Scene-driven Retrieval in Edited Videos using Aesthetic and Semantic Deep Features 204
Towards Video Captioning with Naming: a Novel Dataset and a Multi-Modal Approach 202
Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention 194
Dual-Branch Collaborative Transformer for Virtual Try-On 188
Explore and Explain: Self-supervised Navigation and Recounting 181
Learning to Read L'Infinito: Handwritten Text Recognition with Synthetic Training Data 178
Scene segmentation using temporal clustering for accessing and re-using broadcast video 176
From Show to Tell: A Survey on Deep Learning-based Image Captioning 172
Positive-Augmented Constrastive Learning for Image and Video Captioning Evaluation 163
Towards Reliable Experiments on the Performance of Connected Components Labeling Algorithms 161
NeuralStory: an Interactive Multimedia System for Video Indexing and Re-use 157
Retrieval-Augmented Transformer for Image Captioning 154
SynthCap: Augmenting Transformers with Synthetic Data for Image Captioning 153
Meshed-Memory Transformer for Image Captioning 145
A Unified Cycle-Consistent Neural Model for Text and Image Retrieval 142
Ai4ar: An ai-based mobile application for the automatic generation of ar contents 141
Revisiting The Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis 141
Multimodal Attention Networks for Low-Level Vision-and-Language Navigation 141
A Novel Attention-based Aggregation Function to Combine Vision and Language 136
RMS-Net: Regression and Masking for Soccer Event Spotting 135
Video action detection by learning graph-based spatio-temporal interactions 132
CaMEL: Mean Teacher Learning for Image Captioning 129
Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters 124
Learning to Select: A Fully Attentive Approach for Novel Object Captioning 124
Focus on Impact: Indoor Exploration with Intrinsic Motivation 120
A Deep-learning-based approach to VM behavior Identification in Cloud Systems 113
SMArT: Training Shallow Memory-aware Transformers for Robotic Explainability 111
Improving Indoor Semantic Segmentation with Boundary-level Objectives 110
Investigating Bidimensional Downsampling in Vision Transformer Models 109
Unveiling the Impact of Image Transformations on Deepfake Detection: An Experimental Analysis 106
Boosting Modern and Historical Handwritten Text Recognition with Deformable Convolutions 102
The Unreasonable Effectiveness of CLIP features for Image Captioning: an Experimental Analysis 101
A Computational Approach for Progressive Architecture Shrinkage in Action Recognition 100
Assessing the Role of Boundary-level Objectives in Indoor Semantic Segmentation 97
Working Memory Connections for LSTM 93
Explaining Transformer-based Image Captioning Models: An Empirical Analysis 91
Estimating (and fixing) the Effect of Face Obfuscation in Video Recognition 88
Matching Faces and Attributes Between the Artistic and the Real Domain: the PersonArt Approach 87
Embodied Navigation at the Art Gallery 82
Shot, scene and keyframe ordering for interactive video re-use 81
ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval 75
Spot the Difference: A Novel Task for Embodied Agents in Changing Environments 72
Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates 59
FOSSIL: Free Open-Vocabulary Semantic Segmentation through Synthetic References Retrieval 56
Out of the Box: Embodied Navigation in the Real World 56
The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text Recognition 56
What’s Outside the Intersection? Fine-grained Error Analysis for Semantic Segmentation Beyond IoU 55
Embodied Agents for Efficient Exploration and Smart Scene Description 54
With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning 54
Superpixel Positional Encoding to Improve ViT-based Semantic Segmentation Models 53
Enhancing Open-Vocabulary Semantic Segmentation with Prototype Retrieval 53
Towards Explainable Navigation and Recounting 47
Preface 47
Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation 42
Are Learnable Prompts the Right Way of Prompting? Adapting Vision-and-Language Models with Memory Optimization 34
Generating More Pertinent Captions by Leveraging Semantics and Style on Multi-Source Datasets 33
Video Surveillance and Privacy: A Solvable Paradox? 32
Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs 32
Totale 16.789
Categoria #
all - tutte 61.216
article - articoli 0
book - libri 0
conference - conferenze 0
curatela - curatele 0
other - altro 0
patent - brevetti 0
selected - selezionate 0
volume - volumi 0
Totale 61.216


Totale Lug Ago Sett Ott Nov Dic Gen Feb Mar Apr Mag Giu
2018/2019754 0 0 0 0 0 0 0 0 0 125 352 277
2019/20202.898 292 151 80 208 364 434 470 259 242 124 151 123
2020/20213.458 260 81 218 269 229 459 235 335 289 562 279 242
2021/20223.237 183 139 240 169 84 214 186 230 311 335 825 321
2022/20232.767 364 299 246 227 301 283 92 226 386 75 144 124
2023/20242.092 243 170 246 312 439 160 118 183 70 151 0 0
Totale 16.858