To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 驕ョ襍ヲ驟檎、コ鬆殉驕ョ襍ヲ驟檎、コ鬆旬^ 111010011000000110101110111010001011010110100110111010011000010110001100111001111010010010111010111010011010000010001111011111011110100110000001101011101110100010110101101001101110100110000101100011001110011110100100101110101110100110100000100011110111101101011110 e981aee8b5a6e9858ce7a4bae9a08f7de981aee8b5a6e9858ce7a4bae9a08f7b5e
EUC-JP 驕ョ襍ヲ驟檎、コ鬆殉驕ョ襍ヲ驟檎、コ鬆旬^ 1111000111100001100011101010111011110000101101111000111010100110111100011110010110111000111010011000111010100100100011101011101011110010101000101011110111011110111100011110000110001110101011101111000010110111100011101010011011110001111001011011100011101001100011101010010010001110101110101111001010100010101111011101110001011110 f1e18eaef0b78ea6f1e5b8e98ea48ebaf2a2bddef1e18eaef0b78ea6f1e5b8e98ea48ebaf2a2bddc5e
UTF-8 驕ョ襍ヲ驟檎、コ鬆殉驕ョ襍ヲ驟檎、コ鬆旬^ 11101001101010011001010111101111101111011010111011101000101001011000110111101111101111011010011011101001101010011001111111100110101010101000111011101111101111011010010011101111101111011011101011101001101011001000011011100110101011101000100111101001101010011001010111101111101111011010111011101000101001011000110111101111101111011010011011101001101010011001111111100110101010101000111011101111101111011010010011101111101111011011101011101001101011001000011011100110100101111010110001011110 e9a995efbdaee8a58defbda6e9a99fe6aa8eefbda4efbdbae9ac86e6ae89e9a995efbdaee8a58defbda6e9a99fe6aa8eefbda4efbdbae9ac86e697ac5e
UHC 驕???驟檎???殉驕???驟檎???旬^ 1100111011110110001111110011111100111111111101101010111011010000110101010011111100111111001111111110001011100110110011101111011000111111001111110011111111110110101011101101000011010101001111110011111100111111111000101110001001011110 cef63f3f3ff6aed0d53f3f3fe2e6cef63f3f3ff6aed0d53f3f3fe2e25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)