To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 也?????嗚??}也?????嗚??{^ 10010110111001110011111100111111001111110011111100111111100110100110101000111111001111110111110110010110111001110011111100111111001111110011111100111111100110100110101000111111001111110111101101011110 96e73f3f3f3f3f9a6a3f3f7d96e73f3f3f3f3f9a6a3f3f7b5e
EUC-JP 也?????嗚??}也?????嗚??{^ 11001100111010010011111100111111001111110011111100111111110100111100101100111111001111110111110111001100111010010011111100111111001111110011111100111111110100111100101100111111001111110111101101011110 cce93f3f3f3f3fd3cb3f3f7dcce93f3f3f3f3fd3cb3f3f7b5e
UTF-8 也썹즾溜곕젍嗚몃젧}也썹즾溜곕젍嗚몃젧{^ 111001001011100110011111111011001000110110111001111011001010011010111110111011111010011110001011111010101011001110010101111011001010000010001101111001011001011110011010111010111010101010000011111011001010000010100111011111011110010010111001100111111110110010001101101110011110110010100110101111101110111110100111100010111110101010110011100101011110110010100000100011011110010110010111100110101110101110101010100000111110110010100000101001110111101101011110 e4b99fec8db9eca6beefa78beab395eca08de5979aebaa83eca0a77de4b99fec8db9eca6beefa78beab395eca08de5979aebaa83eca0a77b5e
UHC 也썹즾溜곕젍嗚몃젧}也썹즾溜곕젍嗚몃젧{^ 111001011010010110111101111001111010001110010000111010101111111010110000111010111010000010001110111001111111000010111000111010111010000010011111011111011110010110100101101111011110011110100011100100001110101011111110101100001110101110100000100011101110011111110000101110001110101110100000100111110111101101011110 e5a5bde7a390eafeb0eba08ee7f0b8eba09f7de5a5bde7a390eafeb0eba08ee7f0b8eba09f7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)