To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淨?源?虞???制絅?魄淨?源?虞???制絅?白^ 10011111110001000011111110001100101110010011111110001011111100010011111100111111001111111001000010100111111000110100010000111111111010011010111010011111110001000011111110001100101110010011111110001011111100010011111100111111001111111001000010100111111000110100010000111111100101001001001001011110 9fc43f8cb93f8bf13f3f3f90a7e3443fe9ae9fc43f8cb93f8bf13f3f3f90a7e3443f94925e
EUC-JP 淨?源?虞???制絅?魄淨?源?虞???制絅?白^ 11011110110001100011111110111000101110110011111110110110111100110011111100111111001111111100000010101001111001011010010100111111111100101011000011011110110001100011111110111000101110110011111110110110111100110011111100111111001111111100000010101001111001011010010100111111110001111111001001011110 dec63fb8bb3fb6f33f3f3fc0a9e5a53ff2b0dec63fb8bb3fb6f33f3f3fc0a9e5a53fc7f25e
UTF-8 淨렠源렰虞렧欌렪制絅렠魄淨렠源렰虞렧欌렪制絅렠白^ 11100110101101111010100011101011101000001010000011100110101110101001000011101011101000001011000011101000100110011001111011101011101000001010011111100110101011001000110011101011101000001010101011100101100010001011011011100111101101011000010111101011101000001010000011101001101011011000010011100110101101111010100011101011101000001010000011100110101110101001000011101011101000001011000011101000100110011001111011101011101000001010011111100110101011001000110011101011101000001010101011100101100010001011011011100111101101011000010111101011101000001010000011100111100110011011110101011110 e6b7a8eba0a0e6ba90eba0b0e8999eeba0a7e6ac8ceba0aae588b6e7b585eba0a0e9ad84e6b7a8eba0a0e6ba90eba0b0e8999eeba0a7e6ac8ceba0aae588b6e7b585eba0a0e799bd5e
UHC 淨렠源렰虞렧欌렪制絅렠魄淨렠源렰虞렧欌렪制絅렠白^ 11101111111001001000111010110001111010101011100110001110101111011110100111100101100011101011011011101101111010111000111010111000111100001010010011001100111001111000111010110001110110111101111011101111111001001000111010110001111010101011100110001110101111011110100111100101100011101011011011101101111010111000111010111000111100001010010011001100111001111000111010110001110110111101110001011110 efe48eb1eab98ebde9e58eb6edeb8eb8f0a4cce78eb1dbdeefe48eb1eab98ebde9e58eb6edeb8eb8f0a4cce78eb1dbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)