To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?ャ?乳?????B 001111111000001110000011001111111001001111111011001111110011111100111111001111110011111101000010 3f83833f93fb3f3f3f3f3f42
EUC-JP ?ャ?乳?????B 001111111010010111100011001111111100011011111101001111110011111100111111001111110011111101000010 3fa5e33fc6fd3f3f3f3f3f42
UTF-8 曆ャ끉乳얏략栒쇰꺘B 11101111101001101000101111100011100000111010001111101011100000011000100111100100101110011011001111101100100101101000111111101011100111101011010111100110101000001001001011101100100001111011000011101010101110101001100001000010 efa68be383a3eb8189e4b9b3ec968feb9eb5e6a092ec87b0eaba9842
UHC 曆ャ끉乳얏략栒쇰꺘B 11100110101101111010101111100011100001011011110011101010111000011011111011100110101101111010101111100010111000111011110011101011100000111011111001000010 e6b7abe385bceae1bee6b7abe2e3bceb83be42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)