Sui Jingfeng f39db26c54 drm: Add kms driver for loongson display controller
Loongson display controller IP has been integrated in both Loongson north
bridge chipset (ls7a1000/ls7a2000) and Loongson SoCs (ls2k1000/ls2k2000).
It has even been included in Loongson's BMC products. It has two display
pipes, and each display pipe supports a primary plane and a cursor plane.

For the DC in the LS7a1000, each display pipe has a DVO output interface,
which is able to support 1920x1080@60Hz. For the DC in the LS7A2000, each
display pipe is equipped with a built-in HDMI encoder, which is compliant
with the HDMI 1.4 specification. The first display pipe is also equipped
with a transparent VGA encoder, which is parallel with the HDMI encoder.
To get a decent performance for writing framebuffer data to the VRAM, the
write combine support should be enabled.

v1 -> v2:
 1) Use hpd status reg when polling for ls7a2000.
 2) Fix all warnings that emerged when compiling with W=1.

v2 -> v3:
 1) Add COMPILE_TEST to Kconfig and make the driver off by default
 2) Alphabetical sorting headers (Thomas)
 3) Untangle register access functions as much as possible (Thomas)
 4) Switch to TTM-based memory manager (Thomas)
 5) Add the chip ID detection function which can be used to distinguish
    chip models
 6) Revise the built-in HDMI phy driver, nearly all main stream mode below
    4K@30Hz is tested, and this driver supports clone(mirror) display mode
    and extend(joint) display mode.

v3 -> v4:
 1) Quickly fix a small mistake.

v4 -> v5:
 1) Add per display pipe debugfs support to the builtin HDMI encoder.

v5 -> v6:
 1) Remove stray code which didn't get used, say lsdc_of_get_reserved_ram
 2) Fix all typos I could found, make sentences and code more readable
 3) Untangle lsdc_hdmi*_connector_detect() function according to the pipe
 4) Rename this driver as loongson.

v6 -> v7:
1) Add prime support for buffer self-sharing, sharing buffer with
   drm/etnaviv is also tested and it works with limitations.
2) Implement buffer object tracking with list_head.
3) Add S3(sleep to RAM) support
4) Rewrite lsdc_bo_move since TTM core stop allocating resources
    during BO creation. Patch V1 ~ V6 of this series no longer work.
    Thus, we send V7.

v7 -> v8:
 1) Zero a compile warning on a 32-bit platform, compile with W=1
 2) Revise lsdc_bo_gpu_offset() and make minor cleanups.
 3) Pageflip tested on the virtual terminal with the following commands:

    modetest -M loongson -s 32:1920x1080 -v
    modetest -M loongson -s 34:1920x1080 -v -F tiles

   It works like a charm, when running the pageflip test with dual screens
   configuration, another two additional BOs were created by the modetest,
   VRAM usage up to 40+ MB, well we have at least 64MB, still enough.

   # cat bos

       bo[0000]: size:     8112kB VRAM
       bo[0001]: size:       16kB VRAM
       bo[0002]: size:       16kB VRAM
       bo[0003]: size:    16208kB VRAM
       bo[0004]: size:     8112kB VRAM
       bo[0005]: size:     8112kB VRAM

v8 -> v9:
 1) Select I2C and I2C_ALGOBIT in Kconfig, should depend on MMU.
 2) Using pci_get_domain_bus_and_slot to get the GPU device.

v9 -> v10:
 1) Revise lsdc_drm_freeze() to implement S3 correctly. We realized that
    the pinned BO could not be moved, the VRAM lost power when sleeping
    to RAM. Thus, the data in the buffer who is pinned in VRAM will get
    lost when resumed. Yet it's not a big problem because this driver
    relies on the CPU to update the front framebuffer. We can see the
    garbage data when resume from S3, but the screen will show the right
    image as I move the cursor. This is due to the CPU repaint. v10 of
    this patch makes S3 perfect by unpin all of the BOs in VRAM, evict
    them all to system RAM in lsdc_drm_freeze().

v10 -> v11:
 1) On a double-screen case, The buffer object backing the single giant
    framebuffer is referenced by two GEM objects; hence, it will be
    pinned at least twice by prepare_fb() function. This causes its pin
    count > 1. V10 of this patch only unpins VRAM BOs once when suspend,
    which is not correct on double-screen case. V11 of this patch unpin
    the BOs until its pin count reaches zero when suspend. Then, we make
    the S3 support complete finally. With v11, I can't see any garbage
    data when resume.

 2) Fix vblank wait timeout when disable CRTC.
 3) Test against IGT, at least fbdev test and kms_flip test passed.
 4) Rewrite pixel PLL update function, magic numbers eliminated (Emil)
 5) Drop a few common hardware features description in lsdc_desc (Emil)
 6) Drop lsdc_mode_config_mode_valid(), instead add restrictions in dumb
    create function. (Emil)
 7) Untangle the ls7a1000 case and ls7a2000 case completely (Thomas)

v11 -> v12:
 none

v12 -> v13:
 1) Add benchmarks to figure out the bandwidth of the hardware platform.
    Usage:
    # cd /sys/kernel/debug/dri/0/
    # cat benchmark

 2) VRAM is filled with garbage data if uninitialized, add a buffer
    clearing procedure (lsdc_bo_clear), clear the BO on creation time.
 3) Update copyrights and adjust coding style (Huacai)

v13 -> v14:
 1) Trying to add async update support for cursor plane.

v14 -> v15:
 1) Add lsdc_vga_set_decode() funciton, which allow us remove multi-video
    cards workaround, now it allow drm/loongson, drm/amdgpu, drm/etnaviv
    co-exist in the system, more is also possible (Emil and Xuerui)
 2) Fix typos and grammar mistakes as much as possible (Xuerui)
 3) Unify copyrights as GPL-2.0+ (Xuerui)
 4) Fix a bug introduce since V13, TTM may import BO from other drivers,
    we shouldn't clear it on such a case.

Cc: Maarten Lankhorst <maarten.lankhorst@linux.intel.com>
Cc: Maxime Ripard <mripard@kernel.org>
Cc: Thomas Zimmermann <tzimmermann@suse.de>
Cc: David Airlie <airlied@gmail.com>
Cc: Daniel Vetter <daniel@ffwll.ch>
Cc: Sumit Semwal <sumit.semwal@linaro.org>
Cc: "Christian König" <christian.koenig@amd.com>
Cc: Nathan Chancellor <nathan@kernel.org>
Cc: Emil Velikov <emil.l.velikov@gmail.com>
Cc: Geert Uytterhoeven <geert+renesas@glider.be>
Cc: loongson-kernel@lists.loongnix.cn
Tested-by: Liu Peibao <liupeibao@loongson.cn>
Tested-by: Li Yi  <liyi@loongson.cn>
Acked-by: Thomas Zimmermann <tzimmermann@suse.de>
Signed-off-by: Sui Jingfeng <suijingfeng@loongson.cn>
Link: https://patchwork.freedesktop.org/patch/msgid/20230615143613.1236245-2-15330273260@189.cn
2023-07-05 03:51:02 +08:00

389 lines
9.2 KiB
C

/* SPDX-License-Identifier: GPL-2.0+ */
/*
* Copyright (C) 2023 Loongson Technology Corporation Limited
*/
#ifndef __LSDC_DRV_H__
#define __LSDC_DRV_H__
#include <linux/pci.h>
#include <drm/drm_connector.h>
#include <drm/drm_crtc.h>
#include <drm/drm_device.h>
#include <drm/drm_encoder.h>
#include <drm/drm_file.h>
#include <drm/drm_plane.h>
#include <drm/ttm/ttm_device.h>
#include "lsdc_i2c.h"
#include "lsdc_irq.h"
#include "lsdc_gfxpll.h"
#include "lsdc_output.h"
#include "lsdc_pixpll.h"
#include "lsdc_regs.h"
/* Currently, all Loongson display controllers have two display pipes. */
#define LSDC_NUM_CRTC 2
/*
* LS7A1000/LS7A2000 chipsets function as the south & north bridges of the
* Loongson 3 series processors, they are equipped with on-board video RAM
* typically. While Loongson LS2K series are low cost SoCs which share the
* system RAM as video RAM, they don't has a dedicated VRAM.
*
* There is only a 1:1 mapping of crtcs, encoders and connectors for the DC
*
* display pipe 0 = crtc0 + dvo0 + encoder0 + connector0 + cursor0 + primary0
* display pipe 1 = crtc1 + dvo1 + encoder1 + connectro1 + cursor1 + primary1
*/
enum loongson_chip_id {
CHIP_LS7A1000 = 0,
CHIP_LS7A2000 = 1,
CHIP_LS_LAST,
};
const struct lsdc_desc *
lsdc_device_probe(struct pci_dev *pdev, enum loongson_chip_id chip);
struct lsdc_kms_funcs;
/* DC specific */
struct lsdc_desc {
u32 num_of_crtc;
u32 max_pixel_clk;
u32 max_width;
u32 max_height;
u32 num_of_hw_cursor;
u32 hw_cursor_w;
u32 hw_cursor_h;
u32 pitch_align; /* CRTC DMA alignment constraint */
bool has_vblank_counter; /* 32 bit hw vsync counter */
/* device dependent ops, dc side */
const struct lsdc_kms_funcs *funcs;
};
/* GFX related resources wrangler */
struct loongson_gfx_desc {
struct lsdc_desc dc;
u32 conf_reg_base;
/* GFXPLL shared by the DC, GMC and GPU */
struct {
u32 reg_offset;
u32 reg_size;
} gfxpll;
/* Pixel PLL, per display pipe */
struct {
u32 reg_offset;
u32 reg_size;
} pixpll[LSDC_NUM_CRTC];
enum loongson_chip_id chip_id;
char model[64];
};
static inline const struct loongson_gfx_desc *
to_loongson_gfx(const struct lsdc_desc *dcp)
{
return container_of_const(dcp, struct loongson_gfx_desc, dc);
};
struct lsdc_reg32 {
char *name;
u32 offset;
};
/* crtc hardware related ops */
struct lsdc_crtc;
struct lsdc_crtc_hw_ops {
void (*enable)(struct lsdc_crtc *lcrtc);
void (*disable)(struct lsdc_crtc *lcrtc);
void (*enable_vblank)(struct lsdc_crtc *lcrtc);
void (*disable_vblank)(struct lsdc_crtc *lcrtc);
void (*flip)(struct lsdc_crtc *lcrtc);
void (*clone)(struct lsdc_crtc *lcrtc);
void (*get_scan_pos)(struct lsdc_crtc *lcrtc, int *hpos, int *vpos);
void (*set_mode)(struct lsdc_crtc *lcrtc, const struct drm_display_mode *mode);
void (*soft_reset)(struct lsdc_crtc *lcrtc);
void (*reset)(struct lsdc_crtc *lcrtc);
u32 (*get_vblank_counter)(struct lsdc_crtc *lcrtc);
void (*set_dma_step)(struct lsdc_crtc *lcrtc, enum lsdc_dma_steps step);
};
struct lsdc_crtc {
struct drm_crtc base;
struct lsdc_pixpll pixpll;
struct lsdc_device *ldev;
const struct lsdc_crtc_hw_ops *hw_ops;
const struct lsdc_reg32 *preg;
unsigned int nreg;
struct drm_info_list *p_info_list;
unsigned int n_info_list;
bool has_vblank;
};
/* primary plane hardware related ops */
struct lsdc_primary;
struct lsdc_primary_plane_ops {
void (*update_fb_addr)(struct lsdc_primary *plane, u64 addr);
void (*update_fb_stride)(struct lsdc_primary *plane, u32 stride);
void (*update_fb_format)(struct lsdc_primary *plane,
const struct drm_format_info *format);
};
struct lsdc_primary {
struct drm_plane base;
const struct lsdc_primary_plane_ops *ops;
struct lsdc_device *ldev;
};
/* cursor plane hardware related ops */
struct lsdc_cursor;
struct lsdc_cursor_plane_ops {
void (*update_bo_addr)(struct lsdc_cursor *plane, u64 addr);
void (*update_cfg)(struct lsdc_cursor *plane,
enum lsdc_cursor_size cursor_size,
enum lsdc_cursor_format);
void (*update_position)(struct lsdc_cursor *plane, int x, int y);
};
struct lsdc_cursor {
struct drm_plane base;
const struct lsdc_cursor_plane_ops *ops;
struct lsdc_device *ldev;
};
struct lsdc_output {
struct drm_encoder encoder;
struct drm_connector connector;
};
static inline struct lsdc_output *
connector_to_lsdc_output(struct drm_connector *connector)
{
return container_of(connector, struct lsdc_output, connector);
}
static inline struct lsdc_output *
encoder_to_lsdc_output(struct drm_encoder *encoder)
{
return container_of(encoder, struct lsdc_output, encoder);
}
struct lsdc_display_pipe {
struct lsdc_crtc crtc;
struct lsdc_primary primary;
struct lsdc_cursor cursor;
struct lsdc_output output;
struct lsdc_i2c *li2c;
unsigned int index;
};
static inline struct lsdc_display_pipe *
output_to_display_pipe(struct lsdc_output *output)
{
return container_of(output, struct lsdc_display_pipe, output);
}
struct lsdc_kms_funcs {
irqreturn_t (*irq_handler)(int irq, void *arg);
int (*create_i2c)(struct drm_device *ddev,
struct lsdc_display_pipe *dispipe,
unsigned int index);
int (*output_init)(struct drm_device *ddev,
struct lsdc_display_pipe *dispipe,
struct i2c_adapter *ddc,
unsigned int index);
int (*cursor_plane_init)(struct drm_device *ddev,
struct drm_plane *plane,
unsigned int index);
int (*primary_plane_init)(struct drm_device *ddev,
struct drm_plane *plane,
unsigned int index);
int (*crtc_init)(struct drm_device *ddev,
struct drm_crtc *crtc,
struct drm_plane *primary,
struct drm_plane *cursor,
unsigned int index,
bool has_vblank);
};
static inline struct lsdc_crtc *
to_lsdc_crtc(struct drm_crtc *crtc)
{
return container_of(crtc, struct lsdc_crtc, base);
}
static inline struct lsdc_display_pipe *
crtc_to_display_pipe(struct drm_crtc *crtc)
{
return container_of(crtc, struct lsdc_display_pipe, crtc.base);
}
static inline struct lsdc_primary *
to_lsdc_primary(struct drm_plane *plane)
{
return container_of(plane, struct lsdc_primary, base);
}
static inline struct lsdc_cursor *
to_lsdc_cursor(struct drm_plane *plane)
{
return container_of(plane, struct lsdc_cursor, base);
}
struct lsdc_crtc_state {
struct drm_crtc_state base;
struct lsdc_pixpll_parms pparms;
};
struct lsdc_gem {
/* @mutex: protect objects list */
struct mutex mutex;
struct list_head objects;
};
struct lsdc_device {
struct drm_device base;
struct ttm_device bdev;
/* @descp: features description of the DC variant */
const struct lsdc_desc *descp;
struct pci_dev *dc;
struct pci_dev *gpu;
struct loongson_gfxpll *gfxpll;
/* @reglock: protects concurrent access */
spinlock_t reglock;
void __iomem *reg_base;
resource_size_t vram_base;
resource_size_t vram_size;
resource_size_t gtt_base;
resource_size_t gtt_size;
struct lsdc_display_pipe dispipe[LSDC_NUM_CRTC];
struct lsdc_gem gem;
u32 irq_status;
/* tracking pinned memory */
size_t vram_pinned_size;
size_t gtt_pinned_size;
/* @num_output: count the number of active display pipe */
unsigned int num_output;
};
static inline struct lsdc_device *tdev_to_ldev(struct ttm_device *bdev)
{
return container_of(bdev, struct lsdc_device, bdev);
}
static inline struct lsdc_device *to_lsdc(struct drm_device *ddev)
{
return container_of(ddev, struct lsdc_device, base);
}
static inline struct lsdc_crtc_state *
to_lsdc_crtc_state(struct drm_crtc_state *base)
{
return container_of(base, struct lsdc_crtc_state, base);
}
void lsdc_debugfs_init(struct drm_minor *minor);
int ls7a1000_crtc_init(struct drm_device *ddev,
struct drm_crtc *crtc,
struct drm_plane *primary,
struct drm_plane *cursor,
unsigned int index,
bool no_vblank);
int ls7a2000_crtc_init(struct drm_device *ddev,
struct drm_crtc *crtc,
struct drm_plane *primary,
struct drm_plane *cursor,
unsigned int index,
bool no_vblank);
int lsdc_primary_plane_init(struct drm_device *ddev,
struct drm_plane *plane,
unsigned int index);
int ls7a1000_cursor_plane_init(struct drm_device *ddev,
struct drm_plane *plane,
unsigned int index);
int ls7a2000_cursor_plane_init(struct drm_device *ddev,
struct drm_plane *plane,
unsigned int index);
/* Registers access helpers */
static inline u32 lsdc_rreg32(struct lsdc_device *ldev, u32 offset)
{
return readl(ldev->reg_base + offset);
}
static inline void lsdc_wreg32(struct lsdc_device *ldev, u32 offset, u32 val)
{
writel(val, ldev->reg_base + offset);
}
static inline void lsdc_ureg32_set(struct lsdc_device *ldev,
u32 offset,
u32 mask)
{
void __iomem *addr = ldev->reg_base + offset;
u32 val = readl(addr);
writel(val | mask, addr);
}
static inline void lsdc_ureg32_clr(struct lsdc_device *ldev,
u32 offset,
u32 mask)
{
void __iomem *addr = ldev->reg_base + offset;
u32 val = readl(addr);
writel(val & ~mask, addr);
}
static inline u32 lsdc_pipe_rreg32(struct lsdc_device *ldev,
u32 offset, u32 pipe)
{
return readl(ldev->reg_base + offset + pipe * CRTC_PIPE_OFFSET);
}
static inline void lsdc_pipe_wreg32(struct lsdc_device *ldev,
u32 offset, u32 pipe, u32 val)
{
writel(val, ldev->reg_base + offset + pipe * CRTC_PIPE_OFFSET);
}
#endif