To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 堰??誼??違??筌??誼ユ? 100010011000000100111111001111111000101101100010001111110011111110001000111000010011111100111111111000101010001100111111001111111000101101100010100000111000011000111111 89813f3f8b623f3f88e13f3fe2a33f3f8b6283863f
EUC-JP 堰??誼??違??筌??誼ユ? 101100011110000100111111001111111011010111000011001111110011111110110000111000110011111100111111111001001010010100111111001111111011010111000011101001011110011000111111 b1e13f3fb5c33f3fb0e33f3fe4a53f3fb5c3a5e63f
UTF-8 堰쇰챶誼쎾짃違곸뫊筌뗣꺂誼ユ쉸 111001011010000010110000111011001000011110110000111011001011000110110110111010001010101010111100111011001000111010111110111011001010011110000011111010011000000110010101111010101011001110111000111010111010101110001010111001111010110110001100111010111001011110100011111010101011101010000010111010001010101010111100111000111000001110100110111011001000100110111000 e5a0b0ec87b0ecb1b6e8aabcec8ebeeca783e98195eab3b8ebab8ae7ad8ceb97a3eaba82e8aabce383a6ec89b8
UHC 堰쇰챶誼쎾짃違곸뫊筌뗣꺂誼ユ쉸 111001011110100010111100111010111010101010000011111010111111111010011011111001011010001110010011111010101101111010000001111011001001000110101100111011111010011110001011111000111000001110101011111010111111111010101011111001101001101010001110 e5e8bcebaa83ebfe9be5a393eade81ec91acefa78be383abebfeabe69a8e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)