To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN ???}????颱Lh???}????颱L 00111111001111110011111110000001011100000011111100111111001111110011111111101001010001010100110001101000001111110011111100111111100000010111000000111111001111110011111100111111111010010100010101001100 3f3f3f81703f3f3f3fe9454c683f3f3f81703f3f3f3fe9454c
EUC-JP ???}????颱Lh???}????颱L 00111111001111110011111110100001110100010011111100111111001111110011111111110001101001100100110001101000001111110011111100111111101000011101000100111111001111110011111100111111111100011010011001001100 3f3f3fa1d13f3f3f3ff1a64c683f3f3fa1d13f3f3f3ff1a64c
UTF-8 룵₃룵}룵₃룵ㄱ颱Lh룵₃룵}룵₃룵ㄱ颱L 111010111010001110110101111000101000001010000011111010111010001110110101111011111011110110011101111010111010001110110101111000101000001010000011111010111010001110110101111000111000010010110001111010011010001010110001010011000110100011101011101000111011010111100010100000101000001111101011101000111011010111101111101111011001110111101011101000111011010111100010100000101000001111101011101000111011010111100011100001001011000111101001101000101011000101001100 eba3b5e28283eba3b5efbd9deba3b5e28283eba3b5e384b1e9a2b14c68eba3b5e28283eba3b5efbd9deba3b5e28283eba3b5e384b1e9a2b14c
UHC 룵₃룵}룵₃룵ㄱ颱Lh룵₃룵}룵₃룵ㄱ颱L 100011111010101010101001111111011000111110101010101000111111110110001111101010101010100111111101100011111010101010100100101000011111011111000111010011000110100010001111101010101010100111111101100011111010101010100011111111011000111110101010101010011111110110001111101010101010010010100001111101111100011101001100 8faaa9fd8faaa3fd8faaa9fd8faaa4a1f7c74c688faaa9fd8faaa3fd8faaa9fd8faaa4a1f7c74c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)