To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN ?戡??畯???}v?戡??畯???}vB 00111111100111010100000100111111001111111111101101101111001111110011111100111111011111010111011000111111100111010100000100111111001111111111101101101111001111110011111100111111011111010111011001000010 3f9d413f3ffb6f3f3f3f7d763f9d413f3ffb6f3f3f3f7d7642
EUC-JP 焌戡??畯???}v焌戡??畯???}vB 10001111110010011110100011011001101000100011111100111111100011111100110110111011001111110011111100111111011111010111011010001111110010011110100011011001101000100011111100111111100011111100110110111011001111110011111100111111011111010111011001000010 8fc9e8d9a23f3f8fcdbb3f3f3f7d768fc9e8d9a23f3f8fcdbb3f3f3f7d7642
UTF-8 焌戡렰렗畯쟉렰렑}v焌戡렰렗畯쟉렰렑}vB 1110011110000100100011001110011010001000101000011110101110100000101100001110101110100000100101111110011110010101101011111110110010011111100010011110101110100000101100001110101110100000100100010111110101110110111001111000010010001100111001101000100010100001111010111010000010110000111010111010000010010111111001111001010110101111111011001001111110001001111010111010000010110000111010111010000010010001011111010111011001000010 e7848ce688a1eba0b0eba097e795afec9f89eba0b0eba0917d76e7848ce688a1eba0b0eba097e795afec9f89eba0b0eba0917d7642
UHC 焌戡렰렗畯쟉렰렑}v焌戡렰렗畯쟉렰렑}vB 11110001111000001100101011110001100011101011110110001110101011001111000111100001110000001111000110001110101111011000111010100110011111010111011011110001111000001100101011110001100011101011110110001110101011001111000111100001110000001111000110001110101111011000111010100110011111010111011001000010 f1e0caf18ebd8eacf1e1c0f18ebd8ea67d76f1e0caf18ebd8eacf1e1c0f18ebd8ea67d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)