To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 弱??絶??庄?弱??絶??庄?B 1000111011100011001111110011111110010000111000100011111100111111100011111010111100111111100011101110001100111111001111111001000011100010001111110011111110001111101011110011111101000010 8ee33f3f90e23f3f8faf3f8ee33f3f90e23f3f8faf3f42
EUC-JP 弱??絶??庄?弱??絶??庄?B 1011110011100101001111110011111111000000111001000011111100111111101111101011000100111111101111001110010100111111001111111100000011100100001111110011111110111110101100010011111101000010 bce53f3fc0e43f3fbeb13fbce53f3fc0e43f3fbeb13f42
UTF-8 弱녺쵉絶딂젃庄탖弱녺쵉絶딂젃庄탖B 11100101101111001011000111101011100001011011101011101100101101011000100111100111101101011011011011101011100101001000001011101100101000001000001111100101101110101000010011101101100000111001011011100101101111001011000111101011100001011011101011101100101101011000100111100111101101011011011011101011100101001000001011101100101000001000001111100101101110101000010011101101100000111001011001000010 e5bcb1eb85baecb589e7b5b6eb9482eca083e5ba84ed8396e5bcb1eb85baecb589e7b5b6eb9482eca083e5ba84ed839642
UHC 弱녺쵉絶딂젃庄탖弱녺쵉絶딂젃庄탖B 111001011011000010000110111001111010110010001011111011111011111010001010111010001010000010000111111011011110010010110101011101101110010110110000100001101110011110101100100010111110111110111110100010101110100010100000100001111110110111100100101101010111011001000010 e5b086e7ac8befbe8ae8a087ede4b576e5b086e7ac8befbe8ae8a087ede4b57642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)