To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 伊頭??寃??怨封?經?頭??寃????? 1000100011001001100100111010101000111111001111111001101110000011001111110011111110001001100001011001010110010101001111111110001101010011001111111001001110101010001111110011111110011011100000110011111100111111001111110011111100111111 88c993aa3f3f9b833f3f898595953fe3533f93aa3f3f9b833f3f3f3f3f
EUC-JP 伊頭??寃??怨封?經?頭??寃????? 1011000011001011110001101010110000111111001111111101010111100011001111110011111110110001111001011100100111110101001111111110010110110100001111111100011010101100001111110011111111010101111000110011111100111111001111110011111100111111 b0cbc6ac3f3fd5e33f3fb1e5c9f53fe5b43fc6ac3f3fd5e33f3f3f3f3f
UTF-8 伊頭렧렮寃닻떵怨封렮經렍頭렧렮寃닻떵柳양볕 111001001011110010001010111010011010000010101101111010111010000010100111111010111010000010101110111001011010111110000011111010111000101110111011111010111001011010110101111001101000000010101000111001011011000010000001111010111010000010101110111001111011011010010011111010111010000010001101111010011010000010101101111010111010000010100111111010111010000010101110111001011010111110000011111010111000101110111011111010111001011010110101111011111010011110001001111011001001011010010001111010111011001110010101 e4bc8ae9a0adeba0a7eba0aee5af83eb8bbbeb96b5e680a8e5b081eba0aee7b693eba08de9a0adeba0a7eba0aee5af83eb8bbbeb96b5efa789ec9691ebb395
UHC 伊頭렧렮寃닻떵怨封렮經렍頭렧렮寃닻떵柳양볕 111011001010010111010100111010011000111010110110100011101011101111101010101100101011010011101001101101101011101011101010101100111101110011100110100011101011101111001100111010001000111010100011110101001110100110001110101101101000111010111011111010101011001010110100111010011011011010111010111010101111011110111110111001111011101010110101 eca5d4e98eb68ebbeab2b4e9b6baeab3dce68ebbcce88ea3d4e98eb68ebbeab2b4e9b6baeaf7bee7bab5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)