To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????~????????? 00111111001111110011111100111111001111110011111100111111001111110011111101111110001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f7e3f3f3f3f3f3f3f3f3f
SJIS-WIN 羽?億???倚輕炳~羽?億???倚輕炳 1000100101001000001111111000100110101101001111110011111100111111100110001101111111100111011010101110000001111010011111101000100101001000001111111000100110101101001111110011111100111111100110001101111111100111011010101110000001111010 89483f89ad3f3f3f98dfe76ae07a7e89483f89ad3f3f3f98dfe76ae07a
EUC-JP 羽?億???倚輕炳~羽?億???倚輕炳 1011000110101001001111111011001010101111001111110011111100111111110100001110000111101101110010111101111111011011011111101011000110101001001111111011001010101111001111110011111100111111110100001110000111101101110010111101111111011011 b1a93fb2af3f3f3fd0e1edcbdfdb7eb1a93fb2af3f3f3fd0e1edcbdfdb
UTF-8 羽렡億띨렠렲倚輕炳~羽렡億띨렠렲倚輕炳 11100111101111101011110111101011101000001010000111100101100001001000010011101011100111011010100011101011101000001010000011101011101000001011001011100101100000001001101011101000101111001001010111100111100000101011001101111110111001111011111010111101111010111010000010100001111001011000010010000100111010111001110110101000111010111010000010100000111010111010000010110010111001011000000010011010111010001011110010010101111001111000001010110011 e7bebdeba0a1e58484eb9da8eba0a0eba0b2e5809ae8bc95e782b37ee7bebdeba0a1e58484eb9da8eba0a0eba0b2e5809ae8bc95e782b3
UHC 羽렡億띨렠렲倚輕炳~羽렡億띨렠렲倚輕炳 11101001111000101000111010110010111001011110001010110110111011101000111010110001100011101011111111101011111011111100110011101110110111001011100101111110111010011110001010001110101100101110010111100010101101101110111010001110101100011000111010111111111010111110111111001100111011101101110010111001 e9e28eb2e5e2b6ee8eb18ebfebefcceedcb97ee9e28eb2e5e2b6ee8eb18ebfebefcceedcb9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)