To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 燿??瓦℡?? 11100000101000000011111100111111100010101010001010000111100001000011111100111111 e0a03f3f8aa287843f3f
EUC-JP 燿??瓦??? 111000001010001000111111001111111011010010100100001111110011111100111111 e0a23f3fb4a43f3f3f
UTF-8 燿먲쉑瓦℡떵暳 111001111000011110111111111010111010100010110010111011001000100110010001111001111001001110100110111000101000010010100001111010111001011010110101111001101001101010110011 e787bfeba8b2ec8991e793a6e284a1eb96b5e69ab3
UHC 燿먲쉑瓦℡떵暳 1110100011111100100100001110111110111101101001111110100010111111101000101110010110110110101110101111101110110101 e8fc90efbda7e8bfa2e5b6bafbb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)