To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 臟?圓??肯缺?肯缺臟?圓??肯缺?肯缺B 111001000110011000111111100110101010001000111111001111111000110101101101111000111001111000111111100011010110110111100011100111101110010001100110001111111001101010100010001111110011111110001101011011011110001110011110001111111000110101101101111000111001111001000010 e4663f9aa23f3f8d6de39e3f8d6de39ee4663f9aa23f3f8d6de39e3f8d6de39e42
EUC-JP 臟?圓??肯缺嫄肯缺臟?圓??肯缺嫄肯缺B 11100111110001110011111111010100101001000011111100111111101110011100111011100101111111101000111110111010101000011011100111001110111001011111111011100111110001110011111111010100101001000011111100111111101110011100111011100101111111101000111110111010101000011011100111001110111001011111111001000010 e7c73fd4a43f3fb9cee5fe8fbaa1b9cee5fee7c73fd4a43f3fb9cee5fe8fbaa1b9cee5fe42
UTF-8 臟렞圓꿰렞肯缺嫄肯缺臟렞圓꿰렞肯缺嫄肯缺B 11101000100001111001111111101011101000001001111011100101100111001001001111101010101111111011000011101011101000001001111011101000100000101010111111100111101111001011101011100101101010111000010011101000100000101010111111100111101111001011101011101000100001111001111111101011101000001001111011100101100111001001001111101010101111111011000011101011101000001001111011101000100000101010111111100111101111001011101011100101101010111000010011101000100000101010111111100111101111001011101001000010 e8879feba09ee59c93eabfb0eba09ee882afe7bcbae5ab84e882afe7bcbae8879feba09ee59c93eabfb0eba09ee882afe7bcbae5ab84e882afe7bcba42
UHC 臟렞圓꿰렞肯缺嫄肯缺臟렞圓꿰렞肯缺嫄肯缺B 1110110111110100100011101010111111101010101011011011001011100111100011101010111111010000111010011100110011000000111010101011000111010000111010011100110011000000111011011111010010001110101011111110101010101101101100101110011110001110101011111101000011101001110011001100000011101010101100011101000011101001110011001100000001000010 edf48eafeaadb2e78eafd0e9ccc0eab1d0e9ccc0edf48eafeaadb2e78eafd0e9ccc0eab1d0e9ccc042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)