To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 茹???亦??萸 1110010010100101001111110011111100111111100101101001001000111111001111111110010011001110 e4a53f3f3f96923f3fe4ce
EUC-JP 茹???亦??萸 1110100010100111001111110011111100111111110010111111001000111111001111111110100011010000 e8a73f3f3fcbf23f3fe8d0
UTF-8 茹띾뜄큔亦밤뀿萸 111010001000110010111001111010111001110110111110111010111001110010000100111011011000000110010100111001001011101010100110111010111011000010100100111010111000000010111111111010001001000010111000 e88cb9eb9dbeeb9c84ed8194e4baa6ebb0a4eb80bfe890b8
UHC 茹띾뜄큔亦밤뀿萸 11100110101010101000110111101011100011011000100011000101101001101110011010110010101110011110001110000101101101011110101110101101 e6aa8deb8d88c5a6e6b2b9e385b5ebad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)