To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣o?蹂?? 001111110011111100111111100010111000001110000010100011110011111111100110111110000011111100111111 3f3f3f8b83828f3fe6f83f3f
EUC-JP ???泣o?蹂?? 001111110011111100111111101101011110001110100011111011110011111111101100111110100011111100111111 3f3f3fb5e3a3ef3fecfa3f3f
UTF-8 劣꾨툖泣o쭏蹂⒲럹 111011111010011010011101111010101011111010101000111011011000100010010110111001101011001110100011111011111011110110001111111011001010110110001111111010001011100110000010111000101001001010110010111010111001111110111001 efa69deabea8ed8896e6b3a3efbd8fecad8fe8b982e292b2eb9fb9
UHC 劣꾨툖泣o쭏蹂⒲럹 111001101110101110000100111010111011100010001101111010111110100010100011111011111010011110001000111010111011001110101001111000111000111010011000 e6eb84ebb88debe8a3efa788ebb3a9e38e98

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)