To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 要ょ?節ワ?歪??要ょ?節ワ?歪??^ 1001011101110110100000101110010100111111100100001101111110000011100011110011111110011000011000110011111100111111100101110111011010000010111001010011111110010000110111111000001110001111001111111001100001100011001111110011111101011110 977682e53f90df838f3f98633f3f977682e53f90df838f3f98633f3f5e
EUC-JP 要ょ?節ワ?歪??要ょ?節ワ?歪??^ 1100110111010111101001001110011100111111110000001110000110100101111011110011111111001111110001000011111100111111110011011101011110100100111001110011111111000000111000011010010111101111001111111100111111000100001111110011111101011110 cdd7a4e73fc0e1a5ef3fcfc43f3fcdd7a4e73fc0e1a5ef3fcfc43f3f5e
UTF-8 要ょ몱節ワ슭歪됵쉑要ょ몱節ワ슭歪됵쉑^ 11101000101001101000000111100011100000101000011111101011101010101011000111100111101011111000000011100011100000111010111111101100100010101010110111100110101011011010101011101011100100001011010111101100100010011001000111101000101001101000000111100011100000101000011111101011101010101011000111100111101011111000000011100011100000111010111111101100100010101010110111100110101011011010101011101011100100001011010111101100100010011001000101011110 e8a681e38287ebaab1e7af80e383afec8aade6adaaeb90b5ec8991e8a681e38287ebaab1e7af80e383afec8aade6adaaeb90b5ec89915e
UHC 要ょ몱節ワ슭歪됵쉑要ょ몱節ワ슭歪됵쉑^ 11101001101010011010101011100111100100011001101011101111101111011010101111101111101111011011111011101000111000001000100111101111101111011010011111101001101010011010101011100111100100011001101011101111101111011010101111101111101111011011111011101000111000001000100111101111101111011010011101011110 e9a9aae7919aefbdabefbdbee8e089efbda7e9a9aae7919aefbdabefbdbee8e089efbda75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)