To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?暴??咐げ??韓枇???割荊???白^ 00111111100101100101110000111111001111111001100111110011100000101011000000111111001111111000101011011000100101001111100000111111001111110011111110001010100001001000110001110100001111110011111100111111100101001001001001011110 3f965c3f3f99f382b03f3f8ad894f83f3f3f8a848c743f3f3f94925e
EUC-JP ?暴??咐げ??韓枇???割荊???白^ 00111111110010111011110100111111001111111101001011110101101001001011001000111111001111111011010011011010110010001111101000111111001111110011111110110011111001001011011111010101001111110011111100111111110001111111001001011110 3fcbbd3f3fd2f5a4b23f3fb4dac8fa3f3f3fb3e4b7d53f3f3fc7f25e
UTF-8 뤋暴쭗샘咐げ렗뤋韓枇샘렒뤋割荊쾸쵍샘白^ 11101011101001001000101111100110100110101011010011101100101011011001011111101100100000111001100011100101100100101001000011100011100000011001001011101011101000001001011111101011101001001000101111101001100111111001001111100110100111101000011111101100100000111001100011101011101000001001001011101011101001001000101111100101100010011011001011101000100011011000101011101100101111101011100011101100101101011000110111101100100000111001100011100111100110011011110101011110 eba48be69ab4ecad97ec8398e59290e38192eba097eba48be99f93e69e87ec8398eba092eba48be589b2e88d8aecbeb8ecb58dec8398e799bd5e
UHC 뤋暴쭗샘咐げ렗뤋韓枇샘렒뤋割荊쾸쵍샘白^ 100011111011101111111000111011001010011110001111101110111111100111011100111110111010101010110010100011101010110010001111101110111111100111011011110111011110110110111011111110011000111010100111100011111011101111111001110111001111101110101010101100101000111010101100100011111011101111111001110110111101110001011110 8fbbf8eca78fbbf9dcfbaab28eac8fbbf9dbddedbbf98ea78fbbf9dcfbaab28eac8fbbf9dbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)