Saudi Cultural Missions Theses & Dissertations
Permanent URI for this communityhttps://drepo.sdl.edu.sa/handle/20.500.14154/10
Browse
2 results
Search Results
Item Restricted MFExplain: An Interactive Tool for Explaining Movie Recommendations Generated with Matrix Factorization(University of Zurich, 2023-09-29) Alahmadi, Turki; Bernard, Jürgen; Al Hazwani, IbrahimRecommender systems have become integral in guiding users through the overwhelming abundance of online content. As these systems assume an ever-increasing role in shaping user decisions and preferences, there is a growing demand for clarity in their decision-making processes to instill trust. Recommendation algorithms with a high degree of accuracy such as matrix factorization are highly regarded and widely adopted. Nonetheless, these algorithms tend to exhibit high complexity in their logic and architecture, rendering them challenging to explain to end-users. This issue has been recognized and many tools have presented possible solutions. Many of the implemented approaches, however, have demonstrated shortcomings due to disregarding some user-centered properties or overly concentrating on unraveling the underlying algorithmic intricacy. This work presents MFExplain, an innovative tool for explaining movie recommendations generated with matrix factorization. The tool aims to explain recommendations by relying on the provision of intuitive justifications. Leveraging interactivity and cutting-edge dimensionality reduction techniques enables the tool to also encourage exploration, allow user feedback, and foster many desirable recommender system properties that enrich the user experience.42 0Item Restricted Latent Factorization for Hierarchical and Explainable Embeddings and Data Disaggregation(University of Minnesota Digital Conservancy, 2021) Almutairi, Faisal; Sidiropoulos, NicholasA tremendous growth in data collection has been an important enabler of the recent upsurge in Machine Learning (ML) models. ML techniques involve processing, analyzing, and discovering patterns from real user generated data. These data are usually high-dimensional, sparse, incomplete, and, in many applications, are only available at coarse granularity. For instance, a location mode can be at a state-level rather than county, or a time mode can be on a monthly basis instead of weekly. These (dis)aggregation challenges in real world data raise some intriguing questions and bring some challenging tasks. Given coarse-granular/aggregated data (e.g., monthly summaries), can we recover the fine-granular data (e.g., the daily counts)? Aggregated data enjoy concise representations and thus can be stored and transferred efficiently, which is critical in the era of data deluge. On the other hand, recent ML models are data hungry and benefit from detailed data for personalized analysis and prediction. Thus, data disaggregation algorithms are becoming increasingly important in various domains. In this thesis, we provide data disaggregation frameworks for one-dimensional time series data and multidimensional (tensor) data. The developed models recognize the structure of the data and exploit it to reduce the number of unknown parameters. In a related setting, multidimensional data are often partially observed, e.g., recommender systems data are usually extremely sparse as a user interacts with only a small subset of the available items. Can we reconstruct/complete the missing data? This question is central in many recommendation and more general prediction tasks in various applications such as healthcare, learning and business analytics. A major challenge stems from the fact that the number of unknown parameters is usually much larger than the number of observed samples, which has motivated using prior information. Imposing the appropriate regularization prior limits the solution search to the ‘right’ space. In addition to sparsity, high-dimensionality also creates the challenge of ‘hiding’ the underlying structures and causes that can explain the data. In order to tackle this ‘dimensionality curse’, many dimensionality reduction (DR) methods such as principal component analysis (PCA) have been proposed. The reduced-dimension data usually yields better performance in downstream tasks, such as clustering. This suggests that the underlying structure (e.g., clustering) is more pronounced in some low-dimensional space compared to the original data domain. In this thesis, we present principled approaches that bridge incorporating prior information and DR techniques. We rely on low-rank (nonnegative) matrix factorization for DR and incorporate two different types of priors: i) hierarchical tree clustering, and ii) user-item embedding relationships. Imposing these regularization priors not only improves the quality of latent representations, but it also helps reveal more of the underlying structure in latent space. The tree prior provides a meaningful hierarchical clustering in an unsupervised data-driven fashion, while the user-item relationships underpin the latent factors and explain how the resulting recommendations are formed.4 0