To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 也??飮?┸???}也??飮?┸???{^ 100101101110011100111111001111111001111101011010001111111000010010111101001111110011111100111111011111011001011011100111001111110011111110011111010110100011111110000100101111010011111100111111001111110111101101011110 96e73f3f9f5a3f84bd3f3f3f7d96e73f3f9f5a3f84bd3f3f3f7b5e
EUC-JP 也??飮?┸???}也??飮?┸???{^ 110011001110100100111111001111111101110110111011001111111010100010111111001111110011111100111111011111011100110011101001001111110011111111011101101110110011111110101000101111110011111100111111001111110111101101011110 cce93f3fddbb3fa8bf3f3f3f7dcce93f3fddbb3fa8bf3f3f3f7b5e
UTF-8 也㏓뜉飮뀐┸戮녹돇}也㏓뜉飮뀐┸戮녹돇{^ 111001001011100110011111111000111000111110010011111010111001110010001001111010011010001110101110111010111000000010010000111000101001010010111000111011111010011110010010111010111000010110111001111010111000111110000111011111011110010010111001100111111110001110001111100100111110101110011100100010011110100110100011101011101110101110000000100100001110001010010100101110001110111110100111100100101110101110000101101110011110101110001111100001110111101101011110 e4b99fe38f93eb9c89e9a3aeeb8090e294b8efa792eb85b9eb8f877de4b99fe38f93eb9c89e9a3aeeb8090e294b8efa792eb85b9eb8f877b5e
UHC 也㏓뜉飮뀐┸戮녹돇}也㏓뜉飮뀐┸戮녹돇{^ 111001011010010110100111111010111000110110001100111010111110011010110010111011111010011010111111111010111011110110110011111011001000100110011000011111011110010110100101101001111110101110001101100011001110101111100110101100101110111110100110101111111110101110111101101100111110110010001001100110000111101101011110 e5a5a7eb8d8cebe6b2efa6bfebbdb3ec89987de5a5a7eb8d8cebe6b2efa6bfebbdb3ec89987b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)