To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????®????????®????B 00111111001111110011111100111111101011100011111100111111001111110011111100111111001111110011111100111111101011100011111100111111001111110011111101000010 3f3f3f3fae3f3f3f3f3f3f3f3fae3f3f3f3f42
SJIS-WIN 要ラ?節?К節??要ラ?節?К節??B 1001011101110110100000111000100100111111100100001101111100111111100001000100101110010000110111110011111100111111100101110111011010000011100010010011111110010000110111110011111110000100010010111001000011011111001111110011111101000010 977683893f90df3f844b90df3f3f977683893f90df3f844b90df3f3f42
EUC-JP 要ラ?節®К節??要ラ?節®К節??B 110011011101011110100101111010010011111111000000111000011000111110100010111011101010011110101100110000001110000100111111001111111100110111010111101001011110100100111111110000001110000110001111101000101110111010100111101011001100000011100001001111110011111101000010 cdd7a5e93fc0e18fa2eea7acc0e13f3fcdd7a5e93fc0e18fa2eea7acc0e13f3f42
UTF-8 要ラ뜈節®К節㎩넻要ラ뜈節®К節㎩넻B 111010001010011010000001111000111000001110101001111010111001110010001000111001111010111110000000110000101010111011010000100110101110011110101111100000001110001110001110101010011110101110000100101110111110100010100110100000011110001110000011101010011110101110011100100010001110011110101111100000001100001010101110110100001001101011100111101011111000000011100011100011101010100111101011100001001011101101000010 e8a681e383a9eb9c88e7af80c2aed09ae7af80e38ea9eb84bbe8a681e383a9eb9c88e7af80c2aed09ae7af80e38ea9eb84bb42
UHC 要ラ뜈節®К節㎩넻要ラ뜈節®К節㎩넻B 11101001101010011010101111101001100011011000101111101111101111011010001011100111101011001010110011101111101111011010011111100101100001101011010111101001101010011010101111101001100011011000101111101111101111011010001011100111101011001010110011101111101111011010011111100101100001101011010101000010 e9a9abe98d8befbda2e7acacefbda7e586b5e9a9abe98d8befbda2e7acacefbda7e586b542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)