To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 欲??澳??預э?[欲??澳??預э?[^ 1001011101111110001111110011111111100000010100110011111100111111100101110110000110000100100011110011111101011011100101110111111000111111001111111110000001010011001111110011111110010111011000011000010010001111001111110101101101011110 977e3f3fe0533f3f9761848f3f5b977e3f3fe0533f3f9761848f3f5b5e
EUC-JP 欲??澳??預э?[欲??澳??預э?[^ 1100110111011111001111110011111111011111101101000011111100111111110011011100001010100111111011110011111101011011110011011101111100111111001111111101111110110100001111110011111111001101110000101010011111101111001111110101101101011110 cddf3f3fdfb43f3fcdc2a7ef3f5bcddf3f3fdfb43f3fcdc2a7ef3f5b5e
UTF-8 欲뀐풐澳뽲떂預э슭[欲뀐풐澳뽲떂預э슭[^ 11100110101011001011001011101011100000001001000011101101100100101001000011100110101111101011001111101011101111011011001011101011100101101000001011101001101000001001000011010001100011011110110010001010101011010101101111100110101011001011001011101011100000001001000011101101100100101001000011100110101111101011001111101011101111011011001011101011100101101000001011101001101000001001000011010001100011011110110010001010101011010101101101011110 e6acb2eb8090ed9290e6beb3ebbdb2eb9682e9a090d18dec8aad5be6acb2eb8090ed9290e6beb3ebbdb2eb9682e9a090d18dec8aad5b5e
UHC 欲뀐풐澳뽲떂預э슭[欲뀐풐澳뽲떂預э슭[^ 111010011011000010110010111011111011111010010100111001111111111010010110111011101000101110011000111001111110100010101100111011111011110110111110010110111110100110110000101100101110111110111110100101001110011111111110100101101110111010001011100110001110011111101000101011001110111110111101101111100101101101011110 e9b0b2efbe94e7fe96ee8b98e7e8acefbdbe5be9b0b2efbe94e7fe96ee8b98e7e8acefbdbe5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)