To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?割荊?????暴??咐ぇ??暴???B 0011111110001010100001001000110001110100001111110011111100111111001111110011111110010110010111000011111100111111100110011111001110000010101001010011111100111111100101100101110000111111001111110011111101000010 3f8a848c743f3f3f3f3f965c3f3f99f382a53f3f965c3f3f3f42
EUC-JP ?割荊?????暴??咐ぇ??暴???B 0011111110110011111001001011011111010101001111110011111100111111001111110011111111001011101111010011111100111111110100101111010110100100101001110011111100111111110010111011110100111111001111110011111101000010 3fb3e4b7d53f3f3f3f3fcbbd3f3fd2f5a4a73f3fcbbd3f3f3f42
UTF-8 뤋割荊쭖컦샘렒뤋暴쥚샘咐ぇ렟뤋暴쭗샘렋B 11101011101001001000101111100101100010011011001011101000100011011000101011101100101011011001011011101100101110111010011011101100100000111001100011101011101000001001001011101011101001001000101111100110100110101011010011101100101001011001101011101100100000111001100011100101100100101001000011100011100000011000011111101011101000001001111111101011101001001000101111100110100110101011010011101100101011011001011111101100100000111001100011101011101000001000101101000010 eba48be589b2e88d8aecad96ecbba6ec8398eba092eba48be69ab4eca59aec8398e59290e38187eba09feba48be69ab4ecad97ec8398eba08b42
UHC 뤋割荊쭖컦샘렒뤋暴쥚샘咐ぇ렟뤋暴쭗샘렋B 100011111011101111111001110111001111101110101010101001111000111010110000100011111011101111111001100011101010011110001111101110111111100011101100101000101000111110111011111110011101110011111011101010101010011110001110101100001000111110111011111110001110110010100111100011111011101111111001100011101010001001000010 8fbbf9dcfbaaa78eb08fbbf98ea78fbbf8eca28fbbf9dcfbaaa78eb08fbbf8eca78fbbf98ea242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)