To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 逕ッ雖牙ッ倡明蟇倡明逕ッ雖牙ッ倡明蟇倡明B 11100111100101001010111111100101101010111000100111100101101011111001100011100111100101101011111011100101101011111001100011100111100101101011111011100111100101001010111111100101101010111000100111100101101011111001100011100111100101101011111011100101101011111001100011100111100101101011111001000010 e794afe5ab89e5af98e796bee5af98e796bee794afe5ab89e5af98e796bee5af98e796be42
EUC-JP 逕ッ雖牙ッ倡明蟇倡明逕ッ雖牙ッ倡明蟇倡明B 1110110111110100100011101010111111101010101011011011001011100111100011101010111111010000111010011100110011000000111010101011000111010000111010011100110011000000111011011111010010001110101011111110101010101101101100101110011110001110101011111101000011101001110011001100000011101010101100011101000011101001110011001100000001000010 edf48eafeaadb2e78eafd0e9ccc0eab1d0e9ccc0edf48eafeaadb2e78eafd0e9ccc0eab1d0e9ccc042
UTF-8 逕ッ雖牙ッ倡明蟇倡明逕ッ雖牙ッ倡明蟇倡明B 11101001100000001001010111101111101111011010111111101001100110111001011011100111100010011001100111101111101111011010111111100101100000001010000111100110100110001000111011101000100111111000011111100101100000001010000111100110100110001000111011101001100000001001010111101111101111011010111111101001100110111001011011100111100010011001100111101111101111011010111111100101100000001010000111100110100110001000111011101000100111111000011111100101100000001010000111100110100110001000111001000010 e98095efbdafe99b96e78999efbdafe580a1e6988ee89f87e580a1e6988ee98095efbdafe99b96e78999efbdafe580a1e6988ee89f87e580a1e6988e42
UHC 逕?雖牙?倡明?倡明逕?雖牙?倡明?倡明B 1100110011101111001111111110001011001100111001001011001100111111111100111101101111011001101001010011111111110011110110111101100110100101110011001110111100111111111000101100110011100100101100110011111111110011110110111101100110100101001111111111001111011011110110011010010101000010 ccef3fe2cce4b33ff3dbd9a53ff3dbd9a5ccef3fe2cce4b33ff3dbd9a53ff3dbd9a542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)