To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 怏??乙???Ε 1001110010001001001111110011111110001001101100110011111100111111001111111000001110100011 9c893f3f89b33f3f3f83a3
EUC-JP 怏??乙??洹Ε 11010111111010010011111100111111101100101011010100111111001111111000111111000111101110101010011010100101 d7e93f3fb2b53f3f8fc7baa6a5
UTF-8 怏얘랩乙㎩듋洹Ε 1110011010000000100011111110110010010110100110001110101110011110101010011110010010111001100110011110001110001110101010011110101110010011100010111110011010110100101110011100111010010101 e6808fec9698eb9ea9e4b999e38ea9eb938be6b4b9ce95
UHC 怏얘랩乙㎩듋洹Ε 11100100111010001011111011101010101101111010011011101011111000001010011111100101100010101011111011101010101101111010010111000101 e4e8beeab7a6ebe0a7e58abeeab7a5c5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)