To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN 際?掌繃際?掌棚^ 100011011101101100111111100011111011011011100011011111011000110111011011001111111000111110110110100100100100100101011110 8ddb3f8fb6e37d8ddb3f8fb692495e
EUC-JP 際?掌繃際?掌棚^ 101110101101110100111111101111101011100011100101110111101011101011011101001111111011111010111000110000111010101001011110 badd3fbeb8e5debadd3fbeb8c3aa5e
UTF-8 際렚掌繃際렚掌棚^ 11101001100110101001101111101011101000001001101011100110100011101000110011100111101110011000001111101001100110101001101111101011101000001001101011100110100011101000110011100110101000111001101001011110 e99a9beba09ae68e8ce7b983e99a9beba09ae68e8ce6a39a5e
UHC 際렚掌繃際렚掌棚^ 1111000010110111100011101010110111101101111001101101110111011110111100001011011110001110101011011110110111100110110111011101110001011110 f0b78eadede6dddef0b78eadede6dddc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)