To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 瓮ュ?業??雍??鸚 111000010100010010000011100001010011111110001011110001100011111100111111111010001011010000111111001111111110101001011111 e14483853f8bc63f3fe8b43f3fea5f
EUC-JP 瓮ュ?業??雍??鸚 111000011010010110100101111001010011111110110110110010000011111100111111111100001011011000111111001111111111001111000000 e1a5a5e53fb6c83f3ff0b63f3ff3c0
UTF-8 瓮ュ츍業뤺톷雍멩렐鸚 111001111001001110101110111000111000001110100101111011001011100010001101111001101010010110101101111010111010010010111010111011011000011010110111111010011001101110001101111010111010100110101001111010111010000010010000111010011011100010011010 e793aee383a5ecb88de6a5adeba4baed86b7e99b8deba9a9eba090e9b89a
UHC 瓮ュ츍業뤺톷雍멩렐鸚 1110100010110111101010111110010110101110100010001110010111110110100011111110100010110111100010111110100010111100101110001110011010110111101111001110010110100100 e8b7abe5ae88e5f68fe8b78be8bcb8e6b7bce5a4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)