To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?k???????馭 001111111000001010001011001111110011111100111111001111110011111100111111001111111110100101100110 3f828b3f3f3f3f3f3f3fe966
EUC-JP 渶k???????馭 1000111111000111111011011010001111101011001111110011111100111111001111110011111100111111001111111111000111000111 8fc7eda3eb3f3f3f3f3f3f3ff1c7
UTF-8 渶k굞杻볩㎘類㎮봺馭 111001101011100010110110111011111011110110001011111010101011010110011110111011111010011110001000111010111011001110101001111000111000111010011000111011111010011110010000111000111000111010101110111010111011010010111010111010011010011010101101 e6b8b6efbd8beab59eefa788ebb3a9e38e98efa790e38eaeebb4bae9a6ad
UHC 渶k굞杻볩㎘類㎮봺馭 1110011110110111101000111110101110000010100001101110101011110100100100111110111110100111101001011110101110111010101001111110001010010100100000011110010111011111 e7b7a3eb8286eaf493efa7a5ebbaa7e29481e5df

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)