To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ?朧??弄?六 00111111100111100100111100111111001111111001100001001101001111111001100001011010 3f9e4f3f3f984d3f985a
EUC-JP 鋌朧?鋌弄?六 1000111111100100101110111101101110110000001111111000111111100100101110111100111110101110001111111100111110111011 8fe4bbdbb03f8fe4bbcfae3fcfbb
UTF-8 鋌朧껴鋌弄렭六 111010011000101110001100111001101001110010100111111010101011101110110100111010011000101110001100111001011011110010000100111010111010000010101101111001011000010110101101 e98b8ce69ca7eabbb4e98b8ce5bc84eba0ade585ad
UHC 鋌朧껴鋌弄렭六 1110111111111011110101101110100010110010101110001110111111111011110101101110011110001110101110101101011110111111 effbd6e8b2b8effbd6e78ebad7bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)