To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 壯э?澳??橈??}壯э?澳??橈??{^ 1001101011100001100001001000111100111111111000000101001100111111001111111001111011110100001111110011111101111101100110101110000110000100100011110011111111100000010100110011111100111111100111101111010000111111001111110111101101011110 9ae1848f3fe0533f3f9ef43f3f7d9ae1848f3fe0533f3f9ef43f3f7b5e
EUC-JP 壯э?澳??橈??}壯э?澳??橈??{^ 1101010011100011101001111110111100111111110111111011010000111111001111111101110011110110001111110011111101111101110101001110001110100111111011110011111111011111101101000011111100111111110111001111011000111111001111110111101101011110 d4e3a7ef3fdfb43f3fdcf63f3f7dd4e3a7ef3fdfb43f3fdcf63f3f7b5e
UTF-8 壯э쉼澳뽲떁橈삼풆}壯э쉼澳뽲떁橈삼풆{^ 11100101101000111010111111010001100011011110110010001001101111001110011010111110101100111110101110111101101100101110101110010110100000011110011010101001100010001110110010000010101111001110110110010010100001100111110111100101101000111010111111010001100011011110110010001001101111001110011010111110101100111110101110111101101100101110101110010110100000011110011010101001100010001110110010000010101111001110110110010010100001100111101101011110 e5a3afd18dec89bce6beb3ebbdb2eb9681e6a988ec82bced92867de5a3afd18dec89bce6beb3ebbdb2eb9681e6a988ec82bced92867b5e
UHC 壯э쉼澳뽲떁橈삼풆}壯э쉼澳뽲떁橈삼풆{^ 111011011110000010101100111011111011110110110000111001111111111010010110111011101000101110010111111010001111101010111011111011111011111010001110011111011110110111100000101011001110111110111101101100001110011111111110100101101110111010001011100101111110100011111010101110111110111110111110100011100111101101011110 ede0acefbdb0e7fe96ee8b97e8fabbefbe8e7dede0acefbdb0e7fe96ee8b97e8fabbefbe8e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)