To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 雖臥ク杆雖臥ク曷n}雖臥ク杆雖臥ク曷n{^ 111001011010101110001001111001111011100010011110010101111110010110101011100010011110011110111000100111100100101001101110011111011110010110101011100010011110011110111000100111100101011111100101101010111000100111100111101110001001111001001010011011100111101101011110 e5ab89e7b89e57e5ab89e7b89e4a6e7de5ab89e7b89e57e5ab89e7b89e4a6e7b5e
EUC-JP 雖臥ク杆雖臥ク曷n}雖臥ク杆雖臥ク曷n{^ 11101010101011011011001011101001100011101011100011011011101110001110101010101101101100101110100110001110101110001101101110101011011011100111110111101010101011011011001011101001100011101011100011011011101110001110101010101101101100101110100110001110101110001101101110101011011011100111101101011110 eaadb2e98eb8dbb8eaadb2e98eb8dbab6e7deaadb2e98eb8dbb8eaadb2e98eb8dbab6e7b5e
UTF-8 雖臥ク杆雖臥ク曷n}雖臥ク杆雖臥ク曷n{^ 1110100110011011100101101110100010000111101001011110111110111101101110001110011010011101100001101110100110011011100101101110100010000111101001011110111110111101101110001110011010011011101101110110111001111101111010011001101110010110111010001000011110100101111011111011110110111000111001101001110110000110111010011001101110010110111010001000011110100101111011111011110110111000111001101001101110110111011011100111101101011110 e99b96e887a5efbdb8e69d86e99b96e887a5efbdb8e69bb76e7de99b96e887a5efbdb8e69d86e99b96e887a5efbdb8e69bb76e7b5e
UHC 雖臥?杆雖臥?曷n}雖臥?杆雖臥?曷n{^ 111000101100110011101000110000100011111111001010110100101110001011001100111010001100001000111111110010101110001101101110011111011110001011001100111010001100001000111111110010101101001011100010110011001110100011000010001111111100101011100011011011100111101101011110 e2cce8c23fcad2e2cce8c23fcae36e7de2cce8c23fcad2e2cce8c23fcae36e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)