To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?ワ?爰??音??筌 0011111110000011100011110011111111100000101001110011111100111111100010011011100100111111001111111110001010100011 3f838f3fe0a73f3f89b93f3fe2a3
EUC-JP ?ワ?爰??音??筌 0011111110100101111011110011111111100000101010010011111100111111101100101011101100111111001111111110010010100101 3fa5ef3fe0a93f3fb2bb3f3fe4a5
UTF-8 曆ワ퐣爰뤸략音깅툢筌 111011111010011010001011111000111000001110101111111011011001000010100011111001111000100010110000111010111010010010111000111010111001111010110101111010011001111110110011111010101011100110000101111011011000100010100010111001111010110110001100 efa68be383afed90a3e788b0eba4b8eb9eb5e99fb3eab985ed88a2e7ad8c
UHC 曆ワ퐣爰뤸략音깅툢筌 1110011010110111101010111110111110111101100011001110101010111010100011111110011010110111101010111110101111100101101100011110101110111000100110011110111110100111 e6b7abefbd8ceaba8fe6b7abebe5b1ebb899efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)