To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 霓??倚θ?幽??鈺??壹??誘る?? 111010001011110100111111001111111001100011011111100000111100011000111111100101110100100000111111001111111111101111000100001111110011111110011010111000110011111100111111100101110101010110000010111010010011111100111111 e8bd3f3f98df83c63f97483f3ffbc43f3f9ae33f3f975582e93f3f
EUC-JP 霓??倚θ?幽??鈺??壹??誘る?? 11110000101111110011111100111111110100001110000110100110110010000011111111001101101010010011111100111111100011111110001111010101001111110011111111010100111001010011111100111111110011011011011010100100111010110011111100111111 f0bf3f3fd0e1a6c83fcda93f3f8fe3d53f3fd4e53f3fcdb6a4eb3f3f
UTF-8 霓낅뜃倚θ짆幽껊렗鈺곗슙壹띰쬆誘る퉮廬 1110100110011100100100111110101110000010100001011110101110011100100000111110010110000000100110101100111010111000111011001010011110000110111001011011100110111101111010101011101110001010111010111010000010010111111010011000100010111010111010101011001110010111111011001000101010011001111001011010001110111001111010111001110110110000111011001010110010000110111010001010101010011000111000111000001010001011111011011000100110101110111011111010011010000010 e99c93eb8285eb9c83e5809aceb8eca786e5b9bdeabb8aeba097e988baeab397ec8a99e5a3b9eb9db0ecac86e8aa98e3828bed89aeefa682
UHC 霓낅뜃倚θ짆幽껊렗鈺곗슙壹띰쬆誘る퉮廬 1110011111100111100001011110101110001101100001111110101111101111101001011110100010100011100101011110101011101011100000111110101110001110101011001110100010101101101100001110110010011010101001111110110011101100101101101110111110100110100111011110101110101111101010101110101110111001100001101110010111111110 e7e785eb8d87ebefa5e8a395eaeb83eb8eace8adb0ec9aa7ececb6efa69debafaaebb986e5fe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)