To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 壯↓?樟→????壯↓?樟→????^ 100110101110000110000001101010110011111110001111101111101000000110101000001111110011111100111111001111111001101011100001100000011010101100111111100011111011111010000001101010000011111100111111001111110011111101011110 9ae181ab3f8fbe81a83f3f3f3f9ae181ab3f8fbe81a83f3f3f3f5e
EUC-JP 壯↓?樟→????壯↓?樟→????^ 110101001110001110100010101011010011111110111110110000001010001010101010001111110011111100111111001111111101010011100011101000101010110100111111101111101100000010100010101010100011111100111111001111110011111101011110 d4e3a2ad3fbec0a2aa3f3f3f3fd4e3a2ad3fbec0a2aa3f3f3f3f5e
UTF-8 壯↓뀒樟→굢若띺뀒壯↓뀒樟→굢若띸겮^ 11100101101000111010111111100010100001101001001111101011100000001001001011100110101010001001111111100010100001101001001011101010101101011010001011101111101001011011010011101011100111011011101011101011100000001001001011100101101000111010111111100010100001101001001111101011100000001001001011100110101010001001111111100010100001101001001011101010101101011010001011101111101001011011010011101011100111011011100011101010101100101010111001011110 e5a3afe28693eb8092e6a89fe28692eab5a2efa5b4eb9dbaeb8092e5a3afe28693eb8092e6a89fe28692eab5a2efa5b4eb9db8eab2ae5e
UHC 壯↓뀒樟→굢若띺뀒壯↓뀒樟→굢若띸겮^ 11101101111000001010000111101001100001011000110011101101111010011010000111100110100000101000100111100101101011101000110111101001100001011000110011101101111000001010000111101001100001011000110011101101111010011010000111100110100000101000100111100101101011101000110111100111100000011011110001011110 ede0a1e9858cede9a1e68289e5ae8de9858cede0a1e9858cede9a1e68289e5ae8de781bc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)