To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???楯??而h?抑??恙?????塢??^ 00111111001111110011111110001111011111000011111100111111100011101010011110000010100010000011111110010111011111010011111100111111100111001001100100111111001111110011111100111111001111111001101011000111001111110011111101011110 3f3f3f8f7c3f3f8ea782883f977d3f3f9c993f3f3f3f3f9ac73f3f5e
EUC-JP ???楯??而h?抑??恙?????塢??^ 00111111001111110011111110111101110111010011111100111111101111001010100110100011111010000011111111001101110111100011111100111111110101111111100100111111001111110011111100111111001111111101010011001001001111110011111101011110 3f3f3fbddd3f3fbca9a3e83fcdde3f3fd7f93f3f3f3f3fd4c93f3f5e
UTF-8 琉싧옙楯곫셀而h뱫抑먮쩀恙썬냱轢우빰塢묉넀^ 11101111101001111000110011101100100010111010011111101100100110001001100111100110101001011010111111101010101100111010101111101100100001011000000011101000100000001000110011101111101111011000100011101011101100011010101111100110100010101001000111101011101010001010111011101100101010011000000011100110100000011001100111101100100011011010110011101011100000111011000111101111101001101000110111101100100110101011000011101011101110011011000011100101101000011010001011101011101011001000100111101011100001001000000001011110 efa78cec8ba7ec9899e6a5afeab3abec8580e8808cefbd88ebb1abe68a91eba8aeeca980e68199ec8daceb83b1efa68dec9ab0ebb9b0e5a1a2ebac89eb84805e
UHC 琉싧옙楯곫셀而h뱫抑먮쩀恙썬냱轢우빰塢묉넀^ 11101011101001001001101011100101101111111011110111100010111001001000000111100110101111001011111111101100101110111010001111101000100100111001000111100101111001001001000011101011101001001001101011100101101111111011110111100011100001101000000111100110101111001011111111101100101110111010001111100111111100011001000111100110100001101001000001011110 eba49ae5bfbde2e481e6bcbfecbba3e89391e5e490eba49ae5bfbde38681e6bcbfecbba3e7f191e686905e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)