To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 遲語ウエ奣梧怏寬ヲ遲語ウエ奣梧怏寬ヲB 11100111101011011000110011101010101100111011010011111010101000111000110011100110100111001000100111111010101010101010011011100111101011011000110011101010101100111011010011111010101000111000110011100110100111001000100111111010101010101010011001000010 e7ad8ceab3b4faa38ce69c89faaaa6e7ad8ceab3b4faa38ce69c89faaaa642
EUC-JP 遲語ウエ奣梧怏?ヲ遲語ウエ奣梧怏?ヲB 11101110101011111011100011101100100011101011001110001110101101001000111110111000111111001011100011101000110101111110100100111111100011101010011011101110101011111011100011101100100011101011001110001110101101001000111110111000111111001011100011101000110101111110100100111111100011101010011001000010 eeafb8ec8eb38eb48fb8fcb8e8d7e93f8ea6eeafb8ec8eb38eb48fb8fcb8e8d7e93f8ea642
UTF-8 遲語ウエ奣梧怏寬ヲ遲語ウエ奣梧怏寬ヲB 11101001100000011011001011101000101010101001111011101111101111011011001111101111101111011011010011100101101001011010001111100110101000101010011111100110100000001000111111100101101011111010110011101111101111011010011011101001100000011011001011101000101010101001111011101111101111011011001111101111101111011011010011100101101001011010001111100110101000101010011111100110100000001000111111100101101011111010110011101111101111011010011001000010 e981b2e8aa9eefbdb3efbdb4e5a5a3e6a2a7e6808fe5afacefbda6e981b2e8aa9eefbdb3efbdb4e5a5a3e6a2a7e6808fe5afacefbda642
UHC 遲語???梧怏寬?遲語???梧怏寬?B 1111001011000000111001011101111000111111001111110011111111100111111111001110010011101000110011101011000000111111111100101100000011100101110111100011111100111111001111111110011111111100111001001110100011001110101100000011111101000010 f2c0e5de3f3f3fe7fce4e8ceb03ff2c0e5de3f3f3fe7fce4e8ceb03f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)