To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 霑ェ譟辛霑ェ譟診N}霑ェ譟辛霑ェ譟診N{^ 111010001011111110101010111001101001111110010000011010001110100010111111101010101110011010011111100100000110011001001110011111011110100010111111101010101110011010011111100100000110100011101000101111111010101011100110100111111001000001100110010011100111101101011110 e8bfaae69f9068e8bfaae69f90664e7de8bfaae69f9068e8bfaae69f90664e7b5e
EUC-JP 霑ェ譟辛霑ェ譟診N}霑ェ譟辛霑ェ譟診N{^ 11110000110000011000111010101010111011001010000110111111110010011111000011000001100011101010101011101100101000011011111111000111010011100111110111110000110000011000111010101010111011001010000110111111110010011111000011000001100011101010101011101100101000011011111111000111010011100111101101011110 f0c18eaaeca1bfc9f0c18eaaeca1bfc74e7df0c18eaaeca1bfc9f0c18eaaeca1bfc74e7b5e
UTF-8 霑ェ譟辛霑ェ譟診N}霑ェ譟辛霑ェ譟診N{^ 1110100110011100100100011110111110111101101010101110100010101101100111111110100010111110100110111110100110011100100100011110111110111101101010101110100010101101100111111110100010101000101110100100111001111101111010011001110010010001111011111011110110101010111010001010110110011111111010001011111010011011111010011001110010010001111011111011110110101010111010001010110110011111111010001010100010111010010011100111101101011110 e99c91efbdaae8ad9fe8be9be99c91efbdaae8ad9fe8a8ba4e7de99c91efbdaae8ad9fe8be9be99c91efbdaae8ad9fe8a8ba4e7b5e
UHC 霑??辛霑??診N}霑??辛霑??診N{^ 1110111111000101001111110011111111100011111101001110111111000101001111110011111111110010111000000100111001111101111011111100010100111111001111111110001111110100111011111100010100111111001111111111001011100000010011100111101101011110 efc53f3fe3f4efc53f3ff2e04e7defc53f3fe3f4efc53f3ff2e04e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)