To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 業?????袁≪?癰??異??音〓?苑 100010111100011000111111001111110011111100111111001111111110010111001101100000011110000100111111111000011001111000111111001111111000100011011001001111110011111110001001101110011000000110101100001111111000100110010001 8bc63f3f3f3f3fe5cd81e13fe19e3f3f88d93f3f89b981ac3f8991
EUC-JP 業?????袁≪?癰??異??音〓?苑 101101101100100000111111001111110011111100111111001111111110101011001111101000101110001100111111111000011111111000111111001111111011000011011011001111110011111110110010101110111010001010101110001111111011000111110001 b6c83f3f3f3f3feacfa2e33fe1fe3f3fb0db3f3fb2bba2ae3fb1f1
UTF-8 業삳돆杻앲짆袁≪뒴癰귘뫗異녜샒音〓툙苑 111001101010010110101101111011001000001010110011111010111000111110000110111011111010011110001000111011001001010110110010111011001010011110000110111010001010001010000001111000101000100110101010111010111001001010110100111001111001100110110000111010101011011110011000111010111010101110010111111001111001010110110000111010111000010110011100111011001000001110010010111010011001111110110011111000111000000010010011111011011000100010011001111010001000101110010001 e6a5adec82b3eb8f86efa788ec95b2eca786e8a281e289aaeb92b4e799b0eab798ebab97e795b0eb859cec8392e99fb3e38093ed8899e88b91
UHC 業삳돆杻앲짆袁≪뒴癰귘뫗異녜샒音〓툙苑 1110010111110110101110111110101110001001100101111110101011110100100111011110100010100011100101011110101010111110101000011110110010001010101011011110100010111001100000101110001010010001101110011110110010110110101100111110100110011000101111111110101111100101101000011110101110111000100100001110101010111101 e5f6bbeb8997eaf49de8a395eabea1ec8aade8b982e291b9ecb6b3e998bfebe5a1ebb890eabd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)