To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 塋ゅ?耶?オ餓 100110101100100010000010111000110011111110010110111010110011111110000011010010011000100111101100 9ac882e33f96eb3f834989ec
EUC-JP 塋ゅ?耶?オ餓 110101001100101010100100111001010011111111001100111011010011111110100101101010101011001011101110 d4caa4e53fcced3fa5aab2ee
UTF-8 塋ゅ렘耶섊オ餓 111001011010000110001011111000111000001010000101111010111010000010011000111010001000000010110110111011001000010010001010111000111000001010101010111010011010010010010011 e5a18be38285eba098e880b6ec848ae382aae9a493
UHC 塋ゅ렘耶섊オ餓 1110011110101011101010101110010110110111101111011110010110101101100110001110011110101011101010101110010010111011 e7abaae5b7bde5ad98e7abaae4bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)