What is Gemini and Which are Its Top Features?

Google’s Gemini is the next-generation AI digital assistant, designed to replace the classic Google Assistant. Available as both an integrated feature in Android and a standalone chatbot app, Gemini is powered by Google’s large language models (LLMs), making it more intelligent and capable than its predecessor.

With its multimodal capabilities, Gemini can process text, images, and voice inputs, enabling natural and conversational interactions. It also introduces new features that were not possible with the old Google Assistant.

We’ve compiled a list of Gemini features, covering both free and paid versions. Most of these AI tools are available to try with certain usage limits, allowing you to decide whether to upgrade. Let’s dive into Gemini’s top features.

1. Multimodal Gemini

One of Gemini’s biggest advantages is its multimodal capabilities. While Google Assistant supports voice, text, and image inputs, Gemini takes it further by allowing users to interact with videos, code, and even files. The smarter AI chatbot can also engage in conversational interactions and output images and videos, in addition to text.

Gemini supports various input types, making it a versatile AI assistant:

  • Text
  • Files or Documents
  • Images
  • Video
  • Audio
  • Code

Additionally, Gemini Advanced can process larger and multiple files, such as entire project databases or thousands of pages of documents.

2. Generate Images

One of the most useful features of Gemini is text-to-image generation, available in both basic and premium versions. The tool is powered by Google’s Imagen-4 model that produces lifelike images, higher image quality, understand more complex instructions, and offer different styles. However, there are limitations when using it with the free plan, which doesn’t include all effects and has a slower processing time.

How to Generate Images in Gemini

  1. Open the Gemini app on your device.
  2. Enter your prompt, starting with “create,” “generate,” or “draw.”
  3. Specify details, such as styles in realistic or animations, colors, and elements.
  4. Tap the send button and wait for the result.
  5. Refine your prompt for better results and save or share the image.
A smartphone screen displaying a chat interface with a message about creating an image of Harry Potter.
Type in or speak your prompt starting with create or generate image keywords.
Quelle: nextpit
Smartphone screen displaying an image of a character flying on a broomstick while playing Quidditch.
You can adjust the image by refining the prompts with descriptions and styles.
Quelle: nextpit

For a more detailed guide, check out our tutorial on how to create and save AI images with Gemini.

3. Summarize Texts and Videos

Gemini can summarize long texts, emails, documents, and YouTube videos. To enable this feature, you need to activate the Google Workspace and YouTube extensions within the app.

How to summarize texts in Gemini

  1. Open the Gemini app.
  2. Paste or type the text or URL into the prompt box and specify with Gemini to summarize it.
  3. Hit the send button and receive the summary.
A smartphone displaying a message, u0027Hello, Jade,u0027 with a URL link below.
You can paste web urls, texts, documents, etc. on the prompt and let Gemini summarize them.
Quelle: nextpit
Samsung Galaxy S25+ review summary displayed on a smartphone screen.
You can copy or export to Google Docs the summarized text.
Quelle: nextpit

How to summarize YouTube videos (with subtitles enabled)

  1. Open the YouTube video.
  2. Launch or summon Gemini.
  3. Type or speak, “Summarize this video.”
Mobile screen displaying a video interface with text u0027Ask about videou0027 and a podcast titled u0027MWC 2025u0027.
You can summarize YouTube videos using the Ask About Video chip in Gemini.
Quelle: nextpit
Mobile World Congress overview with event details and a summary on a smartphone screen.
Alternatively, you can ask questions or extract information from YouTube videos using Ask About Video.
Quelle: nextpit

You can edit the summary or request a different format, such as bullet points or a paragraph. Additionally, Gemini allows you to export the text to Docs or Gmail. For a deeper dive, check out our tutorial on how to use Gemini to summarize YouTube videos.

4. Ask About What’s On Your Screen

One of Gemini’s most useful features is contextual awareness via “Ask About Screen.” This allows you to get explanations and insights about images and web pages—offering a Google Lens alternative with added AI capabilities.

How to Use Ask About Screen

  1.  Open an image or web page (or take a new photo).
  2. Launch Gemini via gesture or shortcut.
  3. Tap Ask about screen (a screenshot will be taken and uploaded).
  4. Type or speak your query about the image or content.
  5. Hit send.
A smartphone screen displaying colorful fruit snacks at a market.
Go to a photo or web page to use Ask about screen of Gemini.
Quelle: nextpit
A display of colorful fruits and sweets at a market, with people shopping in the background.
Ask queries about the image.
Quelle: nextpit
Smartphone screen displaying a text identifying various fruits.
Ask about results are shown in text format that you can copy or share.
Quelle: nextpit
A glowing orange lantern with Chinese characters hanging in the dark.
You can also extract and translate text using Geminiu0027s Ask about screen.
Quelle: nextpit

You can also upload a photo manually to the Gemini app for analysis or extract text from web pages to summarize, explain, or translate.

5. Ask About PDF Files

The “Ask About PDF” feature works similarly to “Ask About Screen” but focuses on PDF documents. You can summarize, extract text, or search for specific data within a file. This feature is ideal for quickly extracting key information from lengthy documents.

While this is available for the basic and premium Gemini versions, you can do more complex and bigger PDF files in Pro using a more powerful model.

How to Use Ask About PDF

  1. Launch the Google Files app.
  2. Open a PDF document.
  3. Launch Gemini via gesture and then tap Ask about PDF.
  4. Wait until the file is uploaded.
  5. Enter your request and then tap send.
Samsung press release about One UI 7 Beta program with u0027Ask about this PDFu0027 prompt.
Tap on the Ask about this PDF button
Quelle: nextpit
Samsung mobile screen showing a PDF titled u0027Samsung_Presseinformation_Samsungu0027 and text u0027Ask Geminiu0027.
Wait for the PDF file to be uploaded before making a query
Quelle: nextpit
Samsung press release about One UI 7 Beta program launch in Germany and other markets.
Type in or enter via voice your question about the PDF file.
Quelle: nextpit
Samsung press release about One UI 7 Beta program launch in selected markets.
You can take further actions like copy or sharing the results
Quelle: nextpit

6. Use Gemini with Gmail and Calendar

Gemini integrates seamlessly with Google apps, making task management easier. For example, it can extract flight details from Gmail and add them as events in Google Calendar.

How to Use Gemini for Calendar Events

  1. Ask Gemini to pull flight details or schedules from Gmail.
  2. Add or modify events directly in Google Calendar by tapping Continue in Calendar.
A smartphone screen displaying u0027Hello, Jadeu0027 with a dark background.
Enter your prompt specifying to pull details from Gmail and add to your calendar. This works with flight schedules, reservations, meetings, and more.
Quelle: nextpit
Gmail interface showing Google Calendar event creation for AirAsia flights with an u0027Open in Calendaru0027 option.
Tap on Continue to Calendar to manage the schedules.
Quelle: nextpit

Gemini can also detect dates within emails, which are presently available on the web, allowing you to quickly add them to your calendar.

7. Analyze and Create Visualizations (tested on paid version)

Gemini can analyze data from spreadsheets, slides, and documents and visualize them in an easy to digest presentation or provide the most interesting insights and patterns via images or interactive charts via Canvas. 

How to create visual graphs and charts using Gemini

  1. Launch Gemini.
  2. Tap on the + button and select Files.
  3. Upload your spreadsheet (Excel, Sheets, or CSV).
  4. Start your prompt with “visualize” or “create charts or bar graphs from the data.”
A phone screen displaying u0027Hello, Jadeu0027 with a CSV file titled u0027Fruit-Prices-2022.csvu0027 below.
Upload your files such as spreadsheets, PDF, or CSV to Gemini.
Quelle: nextpit
A mobile device displaying a bar chart titled u0027Cup Equivalent Price by Formu0027 with data analysis.
Specify on the prompt to create visualization of the data, trend, or patterns of the file.
Quelle: nextpit

8. Get Code for Interactive and Collaborative Visuals via Canvas

With Canvas feature in Gemini, you can create richer and interactive content like infographics, quizzes, and app in a document or a code. It’s available for both free and paid Gemini apps. Paid users get to access the latest and more capable Gemini model, which has an enhanced performance and handles larger projects.

Here’s how to use Canvas:

  1. Launch Gemini.
  2. Tap the “more” button (often a “+” icon) in the prompt bar and select Canvas.
  3. Type in your prompt to describe the document, code, or interactive content you want to generate.
    1. You can also upload supported files (like PDFs or images) to inform your prompt.
  4. Wait for the content to be generated.
A smartphone screen displaying a message to Jade and a PDF file titled u0027Microplastics_and_rep.pdfu0027.
Select Canvas before entering prompt when generating codes for interactive and collaborative charts and graphs.
Quelle: nextpit
Screenshot of a mobile app interface displaying microplastics research analysis and charts.
You can export the generated documents and test or edit code.
Quelle: nextpit

For document types, you can view or download the file. For code, you can preview it directly within Canvas on both web and mobile platforms, and there are options to share and edit the code.

9. Use Gemini on the Lock Screen

Gemini carries over Google Assistant’s lock screen capabilities like setting an alarm or calendar event, reading notifications or answering calls, allowing users to perform tasks without unlocking their device.

How to Enable Gemini on the Lock Screen

  1. Open the Gemini app.
  2. Tap on your profile picture.
  3. Go to Settings > Gemini on Lock Screen.
  4. Enable Use Gemini without unlocking.
  5. Toggle Make calls and send messages without unlocking.
Gemini settings menu on a smartphone, highlighting u0027Gemini on lock screenu0027 option.
Go to Geminiu0027s Settings u003e Gemini on Lock screen
Quelle: nextpit
Gemini settings on lock screen with options to use Gemini without unlocking and manage calls and messages.
Enable the two toggles to use Gemini on the lock screen.
Quelle: nextpit

10. Control Smart Home with Gemini

Like Google Assistant, Gemini can control smart home devices via the Google Home app. You can issue natural language commands, such as:

  • “It’s hot in the living room, turn on the AC.”
  • “Dim the bedroom lights to 50%.”

How to Enable Smart Home Control

  1. Open the Gemini app.
  2. Tap your profile picture > Apps.
  3. Scroll down and enable Device Control.
  4. Select Google Home and Utilities.
Google account settings with u0027Appsu0027 section highlighted on a smartphone screen.
Go to Geminiu0027s menu and then tap Apps.
Quelle: nextpit
A phone screen displaying device control settings with options for Google Home and Utilities.
Enable the smart home apps in Gemini.
Quelle: nextpit

11. Chat and Share Your Camera with Gemini Live

Gemini Live offers natural, continuous conversations. The free version almost the same capabilities as the the premium, though the latter unlocks deeper interactions, including conversation transcripts for later reference. Follow the guide on how to use Gemini Live.

  1. Launch Gemini.
  2. Tap on the Gemini Live button.
  3. Start your conversation by asking or a greeting.

Alternatively, you can expand the conversation by sharing your screen or camera view with Gemini Live. In short, this is a live view of the Ask about my screen, which you can based your queries and conversation on the visuals shown in the screen. To activate it, just simply press the camera or screen sharing button while engaging with Gemini Live.

For a more detailed look, check out our colleague Antoine Engels’ review of Gemini Live: Talking to Google’s AI is fun, weird, and a bit pointless.

12. Deep Research: Advanced AI Analysis

Deep Research is a newer model in Gemini designed to synthesize large datasets from the web and generate research reports. Users can edit plans before creation, refine reports, discuss findings, and ask follow-up questions. Deep Research is now available to all Gemini users—a useful tool for professionals and students alike.

You can use Deep Research by selecting it from the Gemini model or tapping on the Deep Research button on the web. Read the step-by-step guide below.

  1. Launch Gemini.
  2. Tap on the AI models on top.
  3. Select Deep Research.
  4. Start your prompt with “create a research report” or “make a research.”
  5. Once a plan is generated, you can adjust the plan or continue creating research.
Mobile screen displaying u0027Gemini Advancedu0027 with u0027Deep Researchu0027 option highlighted.
Choose Deep Research model when starting Gemini.
Quelle: nextpit
A smartphone screen displaying a research plan for wearables with a chat interface.
Enter your prompt by specifying create a research report. Check or edit the plan before starting research.
Quelle: nextpit
A smartphone displaying a report titled u0027Latest Trends in the Wearables Sectoru0027.
You can export the research report afterward.
Quelle: nextpit
Smartphone screen displaying wearable technology research and related sources.
Sources are included in the end of the research report.
Quelle: nextpit

If on the web, simply select Deep Research in the prompt window.

13. Create AI Podcasts from Documents

Gemini can convert documents and PDFs into podcast-style discussions, featuring AI-generated hosts via Audio Overview.

How to Generate an Audio Overview using PDFs and files

  1. Create a Deep Research report.
  2. Open the Deep Research file and tap the three-dot menu.
  3. Scroll to the bottom and select Generate Audio Overview.
  4. Wait for the podcast to be generated.

14. Gemini Control Third-Party Apps

Gemini integrates with third-party apps like Spotify and WhatsApp, allowing users to play music, send messages, and control playback with voice commands. Simply speak or write the command to Gemini and specify the app and tasks. Presently, Gemini works with select third-party apps, but could likely see more added in the future.

15. Smarter Search in Google Maps

Gemini also enables smarter search capabilities within Google Maps. Beyond just looking for places or planning your routes, you can now perform contextual searches such as “top things to do at night in this city” or “top-rated Thai restaurants.” The search results will show curated suggestions of places or activities and summarized reviews. You can further ask follow-up questions about a place using the “Ask Maps about this place” feature.

This is a free Gemini feature, although it is still rolling out to various countries and regions.

Screenshot of a mobile app showing curated events by Gemini, including live music and speakeasies.
With Gemini in Google Maps, you can do contextual search for places or activities.
Quelle: nextpit
Mobile phone displaying Next Door Speakeasy u0026 Raw Bar information and reviews.
You get curated results and summarized reviews as well as option for Ask About Maps.
Quelle: nextpit

16. Customize Gemini’s Voice

Unlike Google Assistant, Gemini lets you change the voice of the assistant. Users can choose male or female voices with different tones and accents via Settings > Gemini’s voice.

Google app interface showing account management options with u0027Settingsu0027 highlighted.
Go the Geminiu0027s settings to access voice change feature.
Quelle: nextpit
Gemini settings menu displaying options including u0027Geminiu0027s voiceu0027 highlighted.
Tap on Geminiu0027s voice.
Quelle: nextpit
Screen displaying u0027Geminiu0027s voiceu0027 and u0027Capella: Serene higher voiceu0027
Select the Gemini voice by swiping and then tap back to save changes.
Quelle: nextpit
Smartphone displaying a map with the Old Enchanted Balete Tree and an u0027Ask about placeu0027 button.
Open maps and select a place or location then invoke Gemini. Tap on the Ask about place button.
Quelle: nextpit
A smartphone displaying a map with a search bar and keyboard for entering a new place.
Type in or speak your question.
Quelle: nextpit
Google Maps interface on a smartphone showing a location with a note about entrance fees.
Search results will be in a text or actions like opening a direction on maps.
Quelle: nextpit

Gemini Versions Compared: Free, Premium, Gemini Live

Gemini is available in multiple versions, each offering different features based on subscription plans.

  • Basic (Free) Version – This version is available on all compatible Android devices and offers core AI functions.
  • Premium Version (AI Premium) – Part of the Google One AI subscription, this version provides access to more powerful AI models and advanced capabilities, including complex task management, deep research, coding assistance, logical reasoning, and seamless integration with Google apps through extensions.
  • Gemini Live – A conversational chatbot built into the Gemini app available to the basic and premium versions. The premium tier of Gemini Live includes enhanced features like background conversations, making it a more intelligent and responsive assistant.

The Gemini Premium (AI Premium) plan costs $19.99/month. This subscription includes access to Gemini Ultra 1.0, 2 TB of Google One storage, and AI-powered tools in Gmail, Docs, and Sheets. Google also offers a one-month free trial for new subscribers.

Have you tried Gemini or Gemini Live on your device? Which AI feature do you find most useful? Please let us know in the comments!


This Gemini guide was updated in June 2025 with new tips for using Google’s AI more effectively in more apps.