To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鶯?????苒??泣 11101001111100100011111100111111001111110011111100111111111001001001001000111111001111111000101110000011 e9f23f3f3f3f3fe4923f3f8b83
EUC-JP 鶯?????苒??泣 11110010111101000011111100111111001111110011111100111111111001111111001000111111001111111011010111100011 f2f43f3f3f3f3fe7f23f3fb5e3
UTF-8 鶯뺞찇溜쒖뼏苒믦펹泣 111010011011011010101111111010111011101010011110111011001011000010000111111011111010011110001011111011001001001010010110111010111011110010001111111010001000101110010010111010111010111110100110111011011000111010111001111001101011001110100011 e9b6afebba9eecb087efa78bec9296ebbc8fe88b92ebafa6ed8eb9e6b3a3
UHC 鶯뺞찇溜쒖뼏苒믦펹泣 1110010110100011100101011110011010101001100010111110101011111110100111001110110010010110100101111110011011111110100100101110100010111100100010011110101111101000 e5a395e6a98beafe9cec9697e6fe92e8bc89ebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)