To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???幼??而h?抑??搖?????塢??^ 00111111001111110011111110010111011000110011111100111111100011101010011110000010100010000011111110010111011111010011111100111111100111011000101000111111001111110011111100111111001111111001101011000111001111110011111101011110 3f3f3f97633f3f8ea782883f977d3f3f9d8a3f3f3f3f3f9ac73f3f5e
EUC-JP ???幼??而h?抑??搖?????塢??^ 00111111001111110011111111001101110001000011111100111111101111001010100110100011111010000011111111001101110111100011111100111111110110011110101000111111001111110011111100111111001111111101010011001001001111110011111101011110 3f3f3fcdc43f3fbca9a3e83fcdde3f3fd9ea3f3f3f3f3fd4c93f3f5e
UTF-8 琉딃뻸幼볠셀而h뱫抑먮쨰搖껊뙎轢우빰塢묉넀^ 11101111101001111000110011101011100101001000001111101011101110111011100011100101101110011011110011101011101100111010000011101100100001011000000011101000100000001000110011101111101111011000100011101011101100011010101111100110100010101001000111101011101010001010111011101100101010001011000011100110100100001001011011101010101110111000101011101011100110011000111011101111101001101000110111101100100110101011000011101011101110011011000011100101101000011010001011101011101011001000100111101011100001001000000001011110 efa78ceb9483ebbbb8e5b9bcebb3a0ec8580e8808cefbd88ebb1abe68a91eba8aeeca8b0e69096eabb8aeb998eefa68dec9ab0ebb9b0e5a1a2ebac89eb84805e
UHC 琉딃뻸幼볠셀而h뱫抑먮쨰搖껊뙎轢우빰塢묉넀^ 11101011101001001000101011101001100101101000001111101010111010101001001111100110101111001011111111101100101110111010001111101000100100111001000111100101111001001001000011101011101001001000101011101000111101001000001111101011100011001001001111100110101111001011111111101100101110111010001111100111111100011001000111100110100001101001000001011110 eba48ae99683eaea93e6bcbfecbba3e89391e5e490eba48ae8f483eb8c93e6bcbfecbba3e7f191e686905e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)