To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 業?????淫??歪?9異??碎l?佯 100010111100011000111111001111110011111100111111001111111000100011111010001111110011111110011000011000110011111110000010010110001000100011011001001111110011111111100001111010101000001010001100001111111001100011010001 8bc63f3f3f3f3f88fa3f3f98633f825888d93f3fe1ea828c3f98d1
EUC-JP 業?????淫??歪?9異??碎l?佯 101101101100100000111111001111110011111100111111001111111011000011111100001111110011111111001111110001000011111110100011101110011011000011011011001111110011111111100010111011001010001111101100001111111101000011010011 b6c83f3f3f3f3fb0fc3f3fcfc43fa3b9b0db3f3fe2eca3ec3fd0d3
UTF-8 業삳돆杻쒐쳥淫볦춹歪묐9異녘첑碎l삀佯 111001101010010110101101111011001000001010110011111010111000111110000110111011111010011110001000111011001001001010010000111011001011001110100101111001101011011110101011111010111011001110100110111011001011011010111001111001101010110110101010111010111010110010010000111011111011110010011001111001111001010110110000111010111000010110011000111011001011001010010001111001111010001010001110111011111011110110001100111011001000001010000000111001001011110110101111 e6a5adec82b3eb8f86efa788ec9290ecb3a5e6b7abebb3a6ecb6b9e6adaaebac90efbc99e795b0eb8598ecb291e7a28eefbd8cec8280e4bdaf
UHC 業삳돆杻쒐쳥淫볦춹歪묐9異녘첑碎l삀佯 1110010111110110101110111110101110001001100101111110101011110100100111001110011110101011100010101110101111100010100100111110110010101101100101011110100011100000100100011110101110100011101110011110110010110110101100111110100010101010100111101110000111101111101000111110110010011000100001111110010110111010 e5f6bbeb8997eaf49ce7ab8aebe293ecad95e8e091eba3b9ecb6b3e8aa9ee1efa3ec9887e5ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)