To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 業?????袁≪?娃??徇??音≪???┼ 1000101111000110001111110011111100111111001111110011111111100101110011011000000111100001001111111000100010100001001111110011111110011100011011010011111100111111100010011011100110000001111000010011111100111111001111111000010010101001 8bc63f3f3f3f3fe5cd81e13f88a13f3f9c6d3f3f89b981e13f3f3f84a9
EUC-JP 業?????袁≪?娃??徇??音≪?彛?┼ 10110110110010000011111100111111001111110011111100111111111010101100111110100010111000110011111110110000101000110011111100111111110101111100111000111111001111111011001010111011101000101110001100111111100011111011110011111010001111111010100010101011 b6c83f3f3f3f3feacfa2e33fb0a33f3fd7ce3f3fb2bba2e33f8fbcfa3fa8ab
UTF-8 業삳돆杻앲짆袁≪뒴娃븍툖徇띺샒音≪궡彛뽳┼ 111001101010010110101101111011001000001010110011111010111000111110000110111011111010011110001000111011001001010110110010111011001010011110000110111010001010001010000001111000101000100110101010111010111001001010110100111001011010100010000011111010111011100010001101111011011000100010010110111001011011111010000111111010111001110110111010111011001000001110010010111010011001111110110011111000101000100110101010111010101011011010100001111001011011110110011011111010111011110110110011111000101001010010111100 e6a5adec82b3eb8f86efa788ec95b2eca786e8a281e289aaeb92b4e5a883ebb88ded8896e5be87eb9dbaec8392e99fb3e289aaeab6a1e5bd9bebbdb3e294bc
UHC 業삳돆杻앲짆袁≪뒴娃븍툖徇띺샒音≪궡彛뽳┼ 111001011111011010111011111010111000100110010111111010101111010010011101111010001010001110010101111010101011111010100001111011001000101010101101111010001101111110111010111010111011100010001101111000101101111110001101111010011001100010111111111010111110010110100001111011001000001010110100111011001010110110010110111011111010011010101011 e5f6bbeb8997eaf49de8a395eabea1ec8aade8dfbaebb88de2df8de998bfebe5a1ec82b4ecad96efa6ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)