To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 畑??飮?Ⅴ揄??D畑??飮?Ⅴ揄??D^ 1001010010101000001111110011111110011111010110100011111110000111010110001001110110001001001111110011111101000100100101001010100000111111001111111001111101011010001111111000011101011000100111011000100100111111001111110100010001011110 94a83f3f9f5a3f87589d893f3f4494a83f3f9f5a3f87589d893f3f445e
EUC-JP 畑??飮??揄??D畑??飮??揄??D^ 110010001010101000111111001111111101110110111011001111110011111111011001111010010011111100111111010001001100100010101010001111110011111111011101101110110011111100111111110110011110100100111111001111110100010001011110 c8aa3f3fddbb3f3fd9e93f3f44c8aa3f3fddbb3f3fd9e93f3f445e
UTF-8 畑대벝飮깍Ⅴ揄명뫒D畑대벝飮깍Ⅴ揄명뫒D^ 111001111001010110010001111010111000110010000000111010111011001010011101111010011010001110101110111010101011100110001101111000101000010110100100111001101000111110000100111010111010101010000101111010111010101110010010010001001110011110010101100100011110101110001100100000001110101110110010100111011110100110100011101011101110101010111001100011011110001010000101101001001110011010001111100001001110101110101010100001011110101110101011100100100100010001011110 e79591eb8c80ebb29de9a3aeeab98de285a4e68f84ebaa85ebab9244e79591eb8c80ebb29de9a3aeeab98de285a4e68f84ebaa85ebab92445e
UHC 畑대벝飮깍Ⅴ揄명뫒D畑대벝飮깍Ⅴ揄명뫒D^ 111011111010010110110100111010111001001110111000111010111110011010110001111011111010010110110100111010101111000110111000111011011001000110110100010001001110111110100101101101001110101110010011101110001110101111100110101100011110111110100101101101001110101011110001101110001110110110010001101101000100010001011110 efa5b4eb93b8ebe6b1efa5b4eaf1b8ed91b444efa5b4eb93b8ebe6b1efa5b4eaf1b8ed91b4445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)