Commit Graph

254 Commits

Author SHA1 Message Date
Han Yin 3370bd409c remote: add HuggingFaceModelDetails data class 2025-10-28 11:39:18 -07:00
Han Yin 4ff924b273 lib: optimize engine loader; always perform a fresh detection when cache is null 2025-10-28 11:39:17 -07:00
Han Yin e6413dd05d UI: support `NONE` Llama Tier in general settings 2025-10-28 11:39:17 -07:00
Han Yin 1f41ae2315 lib: refactored InferenceEngineLoader; added a `NONE` Llama Tier 2025-10-28 11:39:17 -07:00
Han Yin 8c6e449ad2 misc: UI polish 2025-10-28 11:39:17 -07:00
Han Yin d211c4c605 UI: add visualizer UI for Arm features 2025-10-28 11:39:17 -07:00
Han Yin 70ec1f7de7 di: add a stub TierDetection; provide both actual impl and stub in AppModule 2025-10-28 11:39:17 -07:00
Han Yin 21e61281fa lib: expose Arm features 2025-10-28 11:39:17 -07:00
Han Yin c5058366dc lib: hide the internal implementations, only expose a facade and interfaces 2025-10-28 11:39:17 -07:00
Han Yin 57c3a9dda7 lib: replace the naive & plain SharedPreferences with DataStore implementation 2025-10-28 11:39:17 -07:00
Han Yin 130cba9aa6 lib: expose GgufMetadataReader as interface only 2025-10-28 11:39:17 -07:00
Han Yin 6a5bc94ff1 [WIP] lib: move GgufMetadata into the lib submodule 2025-10-28 11:39:17 -07:00
Han Yin 4b3f6ef8d7 misc: rename LlamaAndroid related class to InferenceEngine prefixes 2025-10-28 11:39:17 -07:00
Han Yin 72822f0236 misc: reorg the pkg structure 2025-10-28 11:39:17 -07:00
Han Yin b59c59e5c3 core: add back OpenMP due to huge perf loss on TG128 2025-10-28 11:39:17 -07:00
Han Yin 53ac8af67a core: swap out hardcoded LlamaAndroid library loading 2025-10-28 11:39:17 -07:00
Han Yin 1b79db877d core: implement cpu_detector native lib 2025-10-28 11:39:17 -07:00
Han Yin 98c8f5e59e [WIP] llama: enable KleidiAI and disable tier 4 due to `+sve+sve2` bug caused by `ggml_add_cpu_backend_variant_impl` as explained below
```CMake
if (NOT SME_ENABLED MATCHES -1)
...
    set(PRIVATE_ARCH_FLAGS "-fno-tree-vectorize;${PRIVATE_ARCH_FLAGS}+sve+sve2")
...
```
2025-10-28 11:39:17 -07:00
Han Yin ead41ff655 [WIP] llama: disable OpenMP in ABI split since most SoCs are big.LITTLE 2025-10-28 11:39:17 -07:00
Han Yin 3884bbcb86 [WIP] llama: ABI split where five tiers are built sequentially. 2025-10-28 11:39:17 -07:00
Han Yin 75d1abe24a [WIP] llama: ABI split builds five .so artifacts.
However, all .so are performing on SVE level
2025-10-28 11:39:17 -07:00
Han Yin eab502a735 llama: migrate C/CXX flags into CMakeList 2025-10-28 11:39:17 -07:00
Han Yin 512fe1be4e UI: disable landscape mode; prevent duplicated benchmark running 2025-10-28 11:39:17 -07:00
Han Yin a4c66c4baf nit: print current pp & tg in llama-bench 2025-10-28 11:39:17 -07:00
Han Yin bbf0420bac ui: add "clear results" action to Benchmark screen 2025-10-28 11:39:17 -07:00
Han Yin 49df3aea9a UI: map language codes into country Emojis 2025-10-28 11:39:17 -07:00
Han Yin 0c7e1fc7a2 [WIP] UI: replace the HuggingFace stub in Model Management screen with actual API call 2025-10-28 11:39:17 -07:00
Han Yin cfbd271c84 data: update Model data repository to support fetching HuggingFace models 2025-10-28 11:39:17 -07:00
Han Yin 48fa0b23dc data: implement HuggingFace data model, data source with Retrofit API 2025-10-28 11:39:17 -07:00
Han Yin 4b2f769ba8 deps: introduce Retrofit2 2025-10-28 11:39:17 -07:00
Han Yin fe6eafe154 data: update & enhance system prompt presets 2025-10-28 11:39:17 -07:00
Han Yin 81ad468c78 UI: allow hide or show model card on Conversation & Benchmark screens; fix message arrangement 2025-10-28 11:39:17 -07:00
Han Yin 43d9d300aa data: update system prompt presets 2025-10-28 11:39:17 -07:00
Han Yin 27edf0775d UI: polish Conversation screen 2025-10-28 11:39:17 -07:00
Han Yin 379be5d6c3 pkg: restructure system metrics into a separate file 2025-10-28 11:39:17 -07:00
Han Yin b1831c4053 pkg: restructure TopBarApps into separate files in a child package 2025-10-28 11:39:17 -07:00
Han Yin 3c539dc146 pkg: restructure BottomAppBars into separate files in a child package 2025-10-28 11:39:17 -07:00
Han Yin 0bcb182d17 feature: implement Conversation screen's bottom app bar 2025-10-28 11:39:17 -07:00
Han Yin d3011d48e6 nit: disable mode selection's radio buttons when loading model 2025-10-28 11:39:17 -07:00
Han Yin 0dec7fb2a3 ui: polish Benchmark screen; implement its bottom app bar's rerun and share 2025-10-28 11:39:17 -07:00
Han Yin 32f37a4316 [WIP] ui: polish Benchmark screen; implement its bottom app bar 2025-10-28 11:39:17 -07:00
Han Yin ec907d01ba ui: add filter off icon to "Clear filters" menu item 2025-10-28 11:39:17 -07:00
Han Yin 9e4ba94d66 ui: show empty models info in Model Management screen 2025-10-28 11:39:17 -07:00
Han Yin dd0367b970 feature: support filtering in Model Management screen 2025-10-28 11:39:17 -07:00
Han Yin d97e28a6d8 UI: animate FAB on model preselection states 2025-10-28 11:39:17 -07:00
Han Yin 05c620cc52 data: move Model related actions (query, filter, sort) into ModelInfo file 2025-10-28 11:39:17 -07:00
Han Yin ef3791207b UI: extract InfoView out into a separate file for reusability 2025-10-28 11:39:17 -07:00
Han Yin 73330596ae nit: move scaffold related UI components into a separate package 2025-10-28 11:39:17 -07:00
Han Yin 77edad5a01 feature: support searching on Model Selection screen 2025-10-28 11:39:17 -07:00
Han Yin 2b3ba770dd UI: finally support theme modes; remove hardcoded color schemes, default to dynamic color scheme implementation 2025-10-28 11:39:17 -07:00
Han Yin a8dc825aef UI: handle back press on Model Selection screen 2025-10-28 11:39:17 -07:00
Han Yin d1b018e375 UI: show a Snack bar to warn user that system prompt is not always supported 2025-10-28 11:39:17 -07:00
Han Yin 56a7272858 UI: polish model cards on Benchmark and Conversation screens to show model loading metrics 2025-10-28 11:39:17 -07:00
Han Yin 10ca2fa834 util: extract formatting helper functions from FileUtils into a new FormatUtils 2025-10-28 11:39:17 -07:00
Han Yin d7afcc41d5 UI: polish ModelLoading screen 2025-10-28 11:39:17 -07:00
Han Yin 57b5001f5c nit: revert accidental committing of debug code 2025-10-28 11:39:17 -07:00
Han Yin ec47fa3d14 nit: allow deselect model on Model Selection screen 2025-10-28 11:39:17 -07:00
Han Yin 6b74c49e6b UI: polish model card 2025-10-28 11:39:17 -07:00
Han Yin c12ef7a779 UI: update ModelSelectionScreen with a preselect mechanism 2025-10-28 11:39:17 -07:00
Han Yin b81a0c6e6d UI: refactor ModelCard UI to show GGUF metadata 2025-10-28 11:39:17 -07:00
Han Yin 9056f27a91 nit: rename lastUsed field to dateLastUsed; add dateAdded field 2025-10-28 11:39:17 -07:00
Han Yin 7540c2a8b9 nit: refactor data.local package structure 2025-10-28 11:39:17 -07:00
Han Yin 7ed79319e5 GGUF: make GgufMetadata serializable in order to be compatible with Room 2025-10-28 11:39:17 -07:00
Han Yin 8ae0c3d2fa DB: introduce Kotlin serialization extension's library and plugin; add Room runtime library 2025-10-28 11:39:17 -07:00
Han Yin 67499727ef gguf: add GGUF metadata data holder and its corresponding extractor implementation 2025-10-28 11:39:17 -07:00
Han Yin a9466c0370 navigation: sink model loading state management from AppContent down into ModelLoadingScreen; pass ModelLoadingMetrics to Benchmark and Conversation screens 2025-10-28 11:39:17 -07:00
Han Yin 8a682ff85d core: throw Exception instead of returning null if model fails to load 2025-10-28 11:39:17 -07:00
Han Yin f313362ced nit: polish ModelLoadingScreen UI 2025-10-28 11:39:17 -07:00
Han Yin 1d508f367e UI: update AppContent to pass in correct navigation callbacks 2025-10-28 11:39:17 -07:00
Han Yin 0d65c4b06b nit: extract app name into a constant value; remove unused onBackPressed callbacks 2025-10-28 11:39:17 -07:00
Han Yin 9f1d26ac95 UI: migrate ConversationViewModel onto ModelLoadingViewModel; update & refine ConversationScreen 2025-10-28 11:39:17 -07:00
Han Yin cb508be782 UI: migrate ModelLoadingScreen onto ModelLoadingViewModel; update & refine ModelLoadingScreen 2025-10-28 11:39:17 -07:00
Han Yin f61c512223 UI: expose a single facade ModelUnloadDialogHandler; move UnloadModelState into ModelUnloadingViewModel.kt 2025-10-28 11:39:17 -07:00
Han Yin c5a3ac7eb1 UI: Introduce an abstract ViewModel to handle additional model unloading logics 2025-10-28 11:39:17 -07:00
Han Yin e1c77c6bbd LLama: add a new Initializing state; ; add two extension properties; rename LibraryLoaded state to Initialized 2025-10-28 11:39:17 -07:00
Han Yin ba40d689a1 UI: implement BenchmarkScreen's individual back handling 2025-10-28 11:39:17 -07:00
Han Yin 8203ddb97a UI: refactor back handling by removing centralized BackHandlerSetup and UnloadModelConfirmationDialog from AppContent 2025-10-28 11:39:17 -07:00
Han Yin c08d02d233 LLama: add ModelUnloadingState to engine State; add missing state checks in stub engine; fix instrumentation engine's error messages 2025-10-28 11:39:17 -07:00
Han Yin 481ba6e9d3 UI: remove code duplication in sort menu 2025-10-28 11:39:17 -07:00
Han Yin 41615be5ae UI: fix the typo `totalGb` in `StorageMetrics` 2025-10-28 11:39:17 -07:00
Han Yin 69f2bd62f9 UI: replace ugly optional as casts in AppScaffold with extension functions 2025-10-28 11:39:17 -07:00
Han Yin e269da655f UI: combine TopBarConfig and BottomBarConfig into each route's ScaffoldConfig 2025-10-28 11:39:17 -07:00
Han Yin 225c5435c5 UI: refactor BottomBarConfig.ModelsManagement APIs 2025-10-28 11:39:17 -07:00
Han Yin 63fc56d603 UI: centralize the AppScaffold and modularize its configs 2025-10-28 11:39:17 -07:00
Han Yin 72e97b93c5 feature: check for available space before copying local model 2025-10-28 11:39:16 -07:00
Han Yin 65d4a57a8b LLama: refactor loadModel by splitting the system prompt setting into a separate method 2025-10-28 11:39:16 -07:00
Han Yin 9f77155535 VM: handle the cancellation of ongoing token generation 2025-10-28 11:39:16 -07:00
Han Yin 46859c10f0 LLama: update engine state after handling the cancellation of sendUserPrompt 2025-10-28 11:39:16 -07:00
Han Yin 06448a60a5 UI: update UI ongoing model import's cancellation 2025-10-28 11:39:16 -07:00
Han Yin 9ba74a9d3d data: allow canceling the ongoing model import 2025-10-28 11:39:16 -07:00
Han Yin d70b8fe323 core: swap in LLamaAndroid and mark stub engine for testing only 2025-10-28 11:39:16 -07:00
Han Yin c2426a42e5 UI: unify Model Card components 2025-10-28 11:39:16 -07:00
Han Yin 434933f5b3 UI: show model card in Conversation screen 2025-10-28 11:39:16 -07:00
Han Yin 9769467723 UI: show model card in Model Loading screen 2025-10-28 11:39:16 -07:00
Han Yin 9cfa74f754 core: break down InferenceManager due to Interface Segregation Principle 2025-10-28 11:39:16 -07:00
Han Yin 286ed05f13 vm: merge SystemPromptViewModel into ModelLoadingViewModel 2025-10-28 11:39:16 -07:00
Han Yin 23d411d86e vm: split mono MainViewModel into separate individual ViewModels 2025-10-28 11:39:16 -07:00
Han Yin 32d778bb8e core: extract conversation and benchmark logics into InferenceManager; add logs and missing state updates in stub InferenceEngine 2025-10-28 11:39:16 -07:00
Han Yin 51b120f464 data: pass through getModelById from ModelDao into ModelRepository 2025-10-28 11:39:16 -07:00
Han Yin 59f5caa699 Util: split FileUtils from ModelUtils; extract copy methods into FileUtils 2025-10-28 11:39:16 -07:00