To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????TB 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101010001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5442
SJIS-WIN ????????????檍????????蚣TB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111001111011111000001111110011111100111111001111110011111100111111001111110011111111100101011011100101010001000010 3f3f3f3f3f3f3f3f3f3f3f3f9ef83f3f3f3f3f3f3f3fe56e5442
EUC-JP ????????????檍????????蚣TB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111101110011111010001111110011111100111111001111110011111100111111001111110011111111101001110011110101010001000010 3f3f3f3f3f3f3f3f3f3f3f3fdcfa3f3f3f3f3f3f3f3fe9cf5442
UTF-8 溜삳젗溜브세溜깅졎溜뽯졋檍껊졁溜삳젚溜뷸뇘蚣TB 1110111110100111100010111110110010000010101100111110110010100000100101111110111110100111100010111110101110111000100011001110110010000100101110001110111110100111100010111110101010111001100001011110110010100001100011101110111110100111100010111110101110111101101011111110110010100001100010111110011010101010100011011110101010111011100010101110110010100001100000011110111110100111100010111110110010000010101100111110110010100000100110101110111110100111100010111110101110110111101110001110101110000111100110001110100010011010101000110101010001000010 efa78bec82b3eca097efa78bebb88cec84b8efa78beab985eca18eefa78bebbdafeca18be6aa8deabb8aeca181efa78bec82b3eca09aefa78bebb7b8eb8798e89aa35442
UHC 溜삳젗溜브세溜깅졎溜뽯졋檍껊졁溜삳젚溜뷸뇘蚣TB 11101010111111101011101111101011101000001001001111101010111111101011101011101010101111001011110011101010111111101011000111101011101000001011101111101010111111101001011011101011101000001011101011100101111001011000001111101011101000001011001011101010111111101011101111101011101000001001011011101010111111101011101011100110100001111000001111001101111101110101010001000010 eafebbeba093eafebaeabcbceafeb1eba0bbeafe96eba0bae5e583eba0b2eafebbeba096eafebae68783cdf75442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)