To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 哀??油?、吟? 100010001010001100111111001111111001011011111011001111111000000101000001100010111110000100111111 88a33f3f96fb3f81418be13f
EUC-JP 哀??油?、吟? 101100001010010100111111001111111100110011111101001111111010000110100010101101101110001100111111 b0a53f3fccfd3fa1a2b6e33f
UTF-8 哀노강油욕、吟쨑 111001011001001110000000111010111000010110111000111010101011000010010101111001101011001010111001111011001001101010010101111000111000000010000001111001011001000010011111111011001010100010010001 e59380eb85b8eab095e6b2b9ec9a95e38081e5909feca891
UHC 哀노강油욕、吟쨑 11100100111011101011001111101011101100001010110111101010111110101011111111100101101000011010001011101011111000011010010001101000 e4eeb3ebb0adeafabfe5a1a2ebe1a468

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)