To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 莉冶カウ閼ア邂ェ}v莉冶カウ閼ア邂ェ}vB 1110010010111011100101101110100010110110101100111110100010000100101100011110011110101110101010100111110101110110111001001011101110010110111010001011011010110011111010001000010010110001111001111010111010101010011111010111011001000010 e4bb96e8b6b3e884b1e7aeaa7d76e4bb96e8b6b3e884b1e7aeaa7d7642
EUC-JP 莉冶カウ閼ア邂ェ}v莉冶カウ閼ア邂ェ}vB 11101000101111011100110011101010100011101011011010001110101100111110111111100100100011101011000111101110101100001000111010101010011111010111011011101000101111011100110011101010100011101011011010001110101100111110111111100100100011101011000111101110101100001000111010101010011111010111011001000010 e8bdccea8eb68eb3efe48eb1eeb08eaa7d76e8bdccea8eb68eb3efe48eb1eeb08eaa7d7642
UTF-8 莉冶カウ閼ア邂ェ}v莉冶カウ閼ア邂ェ}vB 1110100010001110100010011110010110000110101101101110111110111101101101101110111110111101101100111110100110010110101111001110111110111101101100011110100110000010100000101110111110111101101010100111110101110110111010001000111010001001111001011000011010110110111011111011110110110110111011111011110110110011111010011001011010111100111011111011110110110001111010011000001010000010111011111011110110101010011111010111011001000010 e88e89e586b6efbdb6efbdb3e996bcefbdb1e98282efbdaa7d76e88e89e586b6efbdb6efbdb3e996bcefbdb1e98282efbdaa7d7642
UHC 莉冶??閼?邂?}v莉冶??閼?邂?}vB 1101011111101001111001011010011100111111001111111110010011011001001111111111101010110011001111110111110101110110110101111110100111100101101001110011111100111111111001001101100100111111111110101011001100111111011111010111011001000010 d7e9e5a73f3fe4d93ffab33f7d76d7e9e5a73f3fe4d93ffab33f7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)