To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}????????{^ 00111111001111110011111100111111001111110011111100111111001111110111110100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 爾鬚耳鹿爾鬚?汐}爾鬚耳鹿爾鬚?汐{^ 100011101010001011101001101000101000111010101000100011101010110110001110101000101110100110100010001111111000111010101100011111011000111010100010111010011010001010001110101010001000111010101101100011101010001011101001101000100011111110001110101011000111101101011110 8ea2e9a28ea88ead8ea2e9a23f8eac7d8ea2e9a28ea88ead8ea2e9a23f8eac7b5e
EUC-JP 爾鬚耳鹿爾鬚?汐}爾鬚耳鹿爾鬚?汐{^ 101111001010010011110010101001001011110010101010101111001010111110111100101001001111001010100100001111111011110010101110011111011011110010100100111100101010010010111100101010101011110010101111101111001010010011110010101001000011111110111100101011100111101101011110 bca4f2a4bcaabcafbca4f2a43fbcae7dbca4f2a4bcaabcafbca4f2a43fbcae7b5e
UTF-8 爾鬚耳鹿爾鬚罹汐}爾鬚耳鹿爾鬚罹汐{^ 111001111000100010111110111010011010110010011010111010001000000010110011111010011011100110111111111001111000100010111110111010011010110010011010111011111010011110100110111001101011000110010000011111011110011110001000101111101110100110101100100110101110100010000000101100111110100110111001101111111110011110001000101111101110100110101100100110101110111110100111101001101110011010110001100100000111101101011110 e788bee9ac9ae880b3e9b9bfe788bee9ac9aefa7a6e6b1907de788bee9ac9ae880b3e9b9bfe788bee9ac9aefa7a6e6b1907b5e
UHC 爾鬚耳鹿爾鬚罹汐}爾鬚耳鹿爾鬚罹汐{^ 1110110010110011111000101101000111101100101111001101011011100011111011001011001111100010110100011110110010111010111000001011000101111101111011001011001111100010110100011110110010111100110101101110001111101100101100111110001011010001111011001011101011100000101100010111101101011110 ecb3e2d1ecbcd6e3ecb3e2d1ecbae0b17decb3e2d1ecbcd6e3ecb3e2d1ecbae0b17b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)