To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 馭??揖?筌≪?筍?馭??揖?筌≪?筍?B 11101001011001100011111100111111100101110100101100111111111000101010001110000001111000010011111111100010101000010011111111101001011001100011111100111111100101110100101100111111111000101010001110000001111000010011111111100010101000010011111101000010 e9663f3f974b3fe2a381e13fe2a13fe9663f3f974b3fe2a381e13fe2a13f42
EUC-JP 馭??揖?筌≪?筍?馭??揖?筌≪?筍?B 11110001110001110011111100111111110011011010110000111111111001001010010110100010111000110011111111100100101000110011111111110001110001110011111100111111110011011010110000111111111001001010010110100010111000110011111111100100101000110011111101000010 f1c73f3fcdac3fe4a5a2e33fe4a33ff1c73f3fcdac3fe4a5a2e33fe4a33f42
UTF-8 馭곴퀬揖췇筌≪뼚筍쉡馭곴퀬揖췇筌≪뼚筍쉡B 11101001101001101010110111101010101100111011010011101101100000001010110011100110100011111001011011101100101101111000011111100111101011011000110011100010100010011010101011101011101111001001101011100111101011011000110111101100100010011010000111101001101001101010110111101010101100111011010011101101100000001010110011100110100011111001011011101100101101111000011111100111101011011000110011100010100010011010101011101011101111001001101011100111101011011000110111101100100010011010000101000010 e9a6adeab3b4ed80ace68f96ecb787e7ad8ce289aaebbc9ae7ad8dec89a1e9a6adeab3b4ed80ace68f96ecb787e7ad8ce289aaebbc9ae7ad8dec89a142
UHC 馭곴퀬揖췇筌≪뼚筍쉡馭곴퀬揖췇筌≪뼚筍쉡B 1110010111011111100000011110101010110011101000001110101111100111101011100100001011101111101001111010000111101100100101101010000011100010111011001001101001110110111001011101111110000001111010101011001110100000111010111110011110101110010000101110111110100111101000011110110010010110101000001110001011101100100110100111011001000010 e5df81eab3a0ebe7ae42efa7a1ec96a0e2ec9a76e5df81eab3a0ebe7ae42efa7a1ec96a0e2ec9a7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)