To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 也?????釉??z也?????釉??zB 10010110111001110011111100111111001111110011111100111111111001111101011000111111001111110111101010010110111001110011111100111111001111110011111100111111111001111101011000111111001111110111101001000010 96e73f3f3f3f3fe7d63f3f7a96e73f3f3f3f3fe7d63f3f7a42
EUC-JP 也?????釉??z也?????釉??zB 11001100111010010011111100111111001111110011111100111111111011101101100000111111001111110111101011001100111010010011111100111111001111110011111100111111111011101101100000111111001111110111101001000010 cce93f3f3f3f3feed83f3f7acce93f3f3f3f3feed83f3f7a42
UTF-8 也㏓봺栒듿죰釉낃텞z也㏓봺栒듿죰釉낃텞zB 111001001011100110011111111000111000111110010011111010111011010010111010111001101010000010010010111010111001001110111111111011001010001110110000111010011000011110001001111010111000001010000011111011011000010110011110011110101110010010111001100111111110001110001111100100111110101110110100101110101110011010100000100100101110101110010011101111111110110010100011101100001110100110000111100010011110101110000010100000111110110110000101100111100111101001000010 e4b99fe38f93ebb4bae6a092eb93bfeca3b0e98789eb8283ed859e7ae4b99fe38f93ebb4bae6a092eb93bfeca3b0e98789eb8283ed859e7a42
UHC 也㏓봺栒듿죰釉낃텞z也㏓봺栒듿죰釉낃텞zB 111001011010010110100111111010111001010010000001111000101110001110001010111001011010000110001011111010111011100010000101111010101011011010010101011110101110010110100101101001111110101110010100100000011110001011100011100010101110010110100001100010111110101110111000100001011110101010110110100101010111101001000010 e5a5a7eb9481e2e38ae5a18bebb885eab6957ae5a5a7eb9481e2e38ae5a18bebb885eab6957a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)