To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 永??飮??諭??茹l?永??飮??諭??茹l?B 1000100101101001001111110011111110011111010110100011111100111111100101110100000000111111001111111110010010100101100000101000110000111111100010010110100100111111001111111001111101011010001111110011111110010111010000000011111100111111111001001010010110000010100011000011111101000010 89693f3f9f5a3f3f97403f3fe4a5828c3f89693f3f9f5a3f3f97403f3fe4a5828c3f42
EUC-JP 永??飮??諭??茹l?永??飮??諭??茹l?B 1011000111001010001111110011111111011101101110110011111100111111110011011010000100111111001111111110100010100111101000111110110000111111101100011100101000111111001111111101110110111011001111110011111111001101101000010011111100111111111010001010011110100011111011000011111101000010 b1ca3f3fddbb3f3fcda13f3fe8a7a3ec3fb1ca3f3fddbb3f3fcda13f3fe8a7a3ec3f42
UTF-8 永띠뮋飮뀐쬄諭꾪뮏茹l텥永띠뮋飮뀐쬄諭꾪뮏茹l텥B 11100110101100001011100011101011100111011010000011101011101011101000101111101001101000111010111011101011100000001001000011101100101011001000010011101000101010111010110111101010101111101010101011101011101011101000111111101000100011001011100111101111101111011000110011101101100001011010010111100110101100001011100011101011100111011010000011101011101011101000101111101001101000111010111011101011100000001001000011101100101011001000010011101000101010111010110111101010101111101010101011101011101011101000111111101000100011001011100111101111101111011000110011101101100001011010010101000010 e6b0b8eb9da0ebae8be9a3aeeb8090ecac84e8abadeabeaaebae8fe88cb9efbd8ced85a5e6b0b8eb9da0ebae8be9a3aeeb8090ecac84e8abadeabeaaebae8fe88cb9efbd8ced85a542
UHC 永띠뮋飮뀐쬄諭꾪뮏茹l텥永띠뮋飮뀐쬄諭꾪뮏茹l텥B 11100111101101011011011011101100100100101001100111101011111001101011001011101111101001101001101111101011101100011000010011101101100100101001110011100110101010101010001111101100101101101001101011100111101101011011011011101100100100101001100111101011111001101011001011101111101001101001101111101011101100011000010011101101100100101001110011100110101010101010001111101100101101101001101001000010 e7b5b6ec9299ebe6b2efa69bebb184ed929ce6aaa3ecb69ae7b5b6ec9299ebe6b2efa69bebb184ed929ce6aaa3ecb69a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)