To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 瓮ュ?業??雍??鸚? 11100001010001001000001110000101001111111000101111000110001111110011111111101000101101000011111100111111111010100101111100111111 e14483853f8bc63f3fe8b43f3fea5f3f
EUC-JP 瓮ュ?業??雍??鸚? 11100001101001011010010111100101001111111011011011001000001111110011111111110000101101100011111100111111111100111100000000111111 e1a5a5e53fb6c83f3ff0b63f3ff3c03f
UTF-8 瓮ュ츍業뤺톷雍멱㉦鸚귛 111001111001001110101110111000111000001110100101111011001011100010001101111001101010010110101101111010111010010010111010111011011000011010110111111010011001101110001101111010111010100110110001111000111000100110100110111010011011100010011010111010101011011110011011 e793aee383a5ecb88de6a5adeba4baed86b7e99b8deba9b1e389a6e9b89aeab79b
UHC 瓮ュ츍業뤺톷雍멱㉦鸚귛 11101000101101111010101111100101101011101000100011100101111101101000111111101000101101111000101111101000101111001011100011101000101010001011011111100101101001001000001011100101 e8b7abe5ae88e5f68fe8b78be8bcb8e8a8b7e5a482e5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)