To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 搖?????梧??v搖?????梧??vB 10011101100010100011111100111111001111110011111100111111100011001110011000111111001111110111011010011101100010100011111100111111001111110011111100111111100011001110011000111111001111110111011001000010 9d8a3f3f3f3f3f8ce63f3f769d8a3f3f3f3f3f8ce63f3f7642
EUC-JP 搖?????梧??v搖?????梧??vB 11011001111010100011111100111111001111110011111100111111101110001110100000111111001111110111011011011001111010100011111100111111001111110011111100111111101110001110100000111111001111110111011001000010 d9ea3f3f3f3f3fb8e83f3f76d9ea3f3f3f3f3fb8e83f3f7642
UTF-8 搖쇽쉽樂쒙슁梧잌삻v搖쇽쉽樂쒙슁梧잌삻vB 111001101001000010010110111011001000011110111101111011001000100110111101111011111010011010111111111011001001001010011001111011001000101010000001111001101010001010100111111011001001111010001100111011001000001010111011011101101110011010010000100101101110110010000111101111011110110010001001101111011110111110100110101111111110110010010010100110011110110010001010100000011110011010100010101001111110110010011110100011001110110010000010101110110111011001000010 e69096ec87bdec89bdefa6bfec9299ec8a81e6a2a7ec9e8cec82bb76e69096ec87bdec89bdefa6bfec9299ec8a81e6a2a7ec9e8cec82bb7642
UHC 搖쇽쉽樂쒙슁梧잌삻v搖쇽쉽樂쒙슁梧잌삻vB 111010001111010010111100111011111011110110110001111010001111100110011100111011111011110110110011111001111111110010011111111001011001100010110010011101101110100011110100101111001110111110111101101100011110100011111001100111001110111110111101101100111110011111111100100111111110010110011000101100100111011001000010 e8f4bcefbdb1e8f99cefbdb3e7fc9fe598b276e8f4bcefbdb1e8f99cefbdb3e7fc9fe598b27642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)