To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????w?????????w^ 001111110011111100111111001111110011111100111111001111110011111100111111011101110011111100111111001111110011111100111111001111110011111100111111001111110111011101011110 3f3f3f3f3f3f3f3f3f773f3f3f3f3f3f3f3f3f775e
SJIS-WIN 瘟??役??蘊??w瘟??役??蘊??w^ 111000011000100100111111001111111001011011110000001111110011111111100101010111010011111100111111011101111110000110001001001111110011111110010110111100000011111100111111111001010101110100111111001111110111011101011110 e1893f3f96f03f3fe55d3f3f77e1893f3f96f03f3fe55d3f3f775e
EUC-JP 瘟??役??蘊??w瘟??役??蘊??w^ 111000011110100100111111001111111100110011110010001111110011111111101001101111100011111100111111011101111110000111101001001111110011111111001100111100100011111100111111111010011011111000111111001111110111011101011110 e1e93f3fccf23f3fe9be3f3f77e1e93f3fccf23f3fe9be3f3f775e
UTF-8 瘟룬쮵役숂떥蘊딃쮵w瘟룬쮵役숂떥蘊딃쮵w^ 111001111001100010011111111010111010001110101100111011001010111010110101111001011011110110111001111011001000100010000010111010111001011010100101111010001001100010001010111010111001010010000011111011001010111010110101011101111110011110011000100111111110101110100011101011001110110010101110101101011110010110111101101110011110110010001000100000101110101110010110101001011110100010011000100010101110101110010100100000111110110010101110101101010111011101011110 e7989feba3acecaeb5e5bdb9ec8882eb96a5e8988aeb9483ecaeb577e7989feba3acecaeb5e5bdb9ec8882eb96a5e8988aeb9483ecaeb5775e
UHC 瘟룬쮵役숂떥蘊딃쮵w瘟룬쮵役숂떥蘊딃쮵w^ 111010001011000010110111111010011010100010010010111001101011010110011001111001111000101110111000111010001011001110001010111010011010100010010010011101111110100010110000101101111110100110101000100100101110011010110101100110011110011110001011101110001110100010110011100010101110100110101000100100100111011101011110 e8b0b7e9a892e6b599e78bb8e8b38ae9a89277e8b0b7e9a892e6b599e78bb8e8b38ae9a892775e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)