To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 悟??侑わ?怨??癲??椅??怨??暗 100011001110010100111111001111111001100011010000100000101110110100111111100010011000010100111111001111111110000110011111001111110011111110001000110101100011111100111111100010011000010100111111001111111000100011000011 8ce53f3f98d082ed3f89853f3fe19f3f3f88d63f3f89853f3f88c3
EUC-JP 悟??侑わ?怨??癲??椅??怨??暗 101110001110011100111111001111111101000011010010101001001110111100111111101100011110010100111111001111111110001010100001001111110011111110110000110110000011111100111111101100011110010100111111001111111011000011000101 b8e73f3fd0d2a4ef3fb1e53f3fe2a13f3fb0d83f3fb1e53f3fb0c5
UTF-8 悟귣슗侑わ쭓怨뺤젘癲얇끉椅쇽쭓怨뺤젘暗 111001101000001010011111111010101011011110100011111011001000101010010111111001001011111010010001111000111000001010001111111011001010110110010011111001101000000010101000111010111011101010100100111011001010000010011000111001111001100110110010111011001001011010000111111010111000000110001001111001101010010010000101111011001000011110111101111011001010110110010011111001101000000010101000111010111011101010100100111011001010000010011000111001101001101010010111 e6829feab7a3ec8a97e4be91e3828fecad93e680a8ebbaa4eca098e799b2ec9687eb8189e6a485ec87bdecad93e680a8ebbaa4eca098e69a97
UHC 悟귣슗侑わ쭓怨뺤젘癲얇끉椅쇽쭓怨뺤젘暗 1110011111110110100000101110101110011010101001101110101011100010101010101110111110100111100010111110101010110011100101011110110010100000100101001110111110100110101111101110001110000101101111001110101111110101101111001110111110100111100010111110101010110011100101011110110010100000100101001110010011011110 e7f682eb9aa6eae2aaefa78beab395eca094efa6bee385bcebf5bcefa78beab395eca094e4de

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)