To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 上ソシァセ燻、悉邪上ソシァセ燻、悉邪B 100011111110001110111111101111001010011110111110111000001000111010100100100011101011101110001110110101111000111111100011101111111011110010100111101111101110000010001110101001001000111010111011100011101101011101000010 8fe3bfbca7bee08ea48ebb8ed78fe3bfbca7bee08ea48ebb8ed742
EUC-JP 上ソシァセ燻、悉邪上ソシァセ燻、悉邪B 10111110111001011000111010111111100011101011110010001110101001111000111010111110110111111110111010001110101001001011110010111101101111001101100110111110111001011000111010111111100011101011110010001110101001111000111010111110110111111110111010001110101001001011110010111101101111001101100101000010 bee58ebf8ebc8ea78ebedfee8ea4bcbdbcd9bee58ebf8ebc8ea78ebedfee8ea4bcbdbcd942
UTF-8 上ソシァセ燻、悉邪上ソシァセ燻、悉邪B 11100100101110001000101011101111101111011011111111101111101111011011110011101111101111011010011111101111101111011011111011100111100001111011101111101111101111011010010011100110100000101000100111101001100000101010101011100100101110001000101011101111101111011011111111101111101111011011110011101111101111011010011111101111101111011011111011100111100001111011101111101111101111011010010011100110100000101000100111101001100000101010101001000010 e4b88aefbdbfefbdbcefbda7efbdbee787bbefbda4e68289e982aae4b88aefbdbfefbdbcefbda7efbdbee787bbefbda4e68289e982aa42
UHC 上????燻?悉邪上????燻?悉邪B 110111111011111000111111001111110011111100111111111111011011100000111111111000111111101011011110111101111101111110111110001111110011111100111111001111111111110110111000001111111110001111111010110111101111011101000010 dfbe3f3f3f3ffdb83fe3fadef7dfbe3f3f3f3ffdb83fe3fadef742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)