To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 瀨璽瀨ノハ瀨璽瀨ノハB 1111101101010000100011101010001111111011010100001100100111001010111110110101000010001110101000111111101101010000110010011100101001000010 fb508ea3fb50c9cafb508ea3fb50c9ca42
EUC-JP ?璽?ノハ?璽?ノハB 0011111110111100101001010011111110001110110010011000111011001010001111111011110010100101001111111000111011001001100011101100101001000010 3fbca53f8ec98eca3fbca53f8ec98eca42
UTF-8 瀨璽瀨ノハ瀨璽瀨ノハB 11100111100000001010100011100111100100101011110111100111100000001010100011101111101111101000100111101111101111101000101011100111100000001010100011100111100100101011110111100111100000001010100011101111101111101000100111101111101111101000101001000010 e780a8e792bde780a8efbe89efbe8ae780a8e792bde780a8efbe89efbe8a42
UHC 瀨璽瀨??瀨璽瀨??B 1101011011101110110111111101111011010110111011100011111100111111110101101110111011011111110111101101011011101110001111110011111101000010 d6eedfded6ee3f3fd6eedfded6ee3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)