To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 阡オ貂ャ蜊ウ阡オ}v阡オ貂ャ蜊ウ阡オ}vB 1110100010010100101101011110011010111000101011001110010110001101101100111110100010010100101101010111110101110110111010001001010010110101111001101011100010101100111001011000110110110011111010001001010010110101011111010111011001000010 e894b5e6b8ace58db3e894b57d76e894b5e6b8ace58db3e894b57d7642
EUC-JP 阡オ貂ャ蜊ウ阡オ}v阡オ貂ャ蜊ウ阡オ}vB 11101111111101001000111010110101111011001011101010001110101011001110100111101101100011101011001111101111111101001000111010110101011111010111011011101111111101001000111010110101111011001011101010001110101011001110100111101101100011101011001111101111111101001000111010110101011111010111011001000010 eff48eb5ecba8eace9ed8eb3eff48eb57d76eff48eb5ecba8eace9ed8eb3eff48eb57d7642
UTF-8 阡オ貂ャ蜊ウ阡オ}v阡オ貂ャ蜊ウ阡オ}vB 1110100110011000101000011110111110111101101101011110100010110010100000101110111110111101101011001110100010011100100010101110111110111101101100111110100110011000101000011110111110111101101101010111110101110110111010011001100010100001111011111011110110110101111010001011001010000010111011111011110110101100111010001001110010001010111011111011110110110011111010011001100010100001111011111011110110110101011111010111011001000010 e998a1efbdb5e8b282efbdace89c8aefbdb3e998a1efbdb57d76e998a1efbdb5e8b282efbdace89c8aefbdb3e998a1efbdb57d7642
UHC 阡?貂???阡?}v阡?貂???阡?}vB 111101001100011000111111111101011011000000111111001111110011111111110100110001100011111101111101011101101111010011000110001111111111010110110000001111110011111100111111111101001100011000111111011111010111011001000010 f4c63ff5b03f3f3ff4c63f7d76f4c63ff5b03f3f3ff4c63f7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)