To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 窈???э?怨??午??椅э?恂?┥筌 11100010011101110011111100111111001111111000010010001111001111111000100110000101001111110011111110001100110111110011111100111111100010001101011010000100100011110011111110011100100101100011111110000100101111001110001010100011 e2773f3f3f848f3f89853f3f8cdf3f3f88d6848f3f9c963f84bce2a3
EUC-JP 窈???э?怨??午??椅э?恂?┥筌 11100011110110000011111100111111001111111010011111101111001111111011000111100101001111110011111110111000111000010011111100111111101100001101100010100111111011110011111111010111111101100011111110101000101111101110010010100101 e3d83f3f3fa7ef3fb1e53f3fb8e13f3fb0d8a7ef3fd7f63fa8bee4a5
UTF-8 窈띲끇鱗э쭓怨뺤졂午대뀪椅э쭓恂㏃┥筌 11100111101010101000100011101011100111011011001011101011100000011000011111101111101001111011001011010001100011011110110010101101100100111110011010000000101010001110101110111010101001001110110010100001100000101110010110001101100010001110101110001100100000001110101110000000101010101110011010100100100001011101000110001101111011001010110110010011111001101000000110000010111000111000111110000011111000101001010010100101111001111010110110001100 e7aa88eb9db2eb8187efa7b2d18decad93e680a8ebbaa4eca182e58d88eb8c80eb80aae6a485d18decad93e68182e38f83e294a5e7ad8c
UHC 窈띲끇鱗э쭓怨뺤졂午대뀪椅э쭓恂㏃┥筌 1110100110100001100011011110001110000101101110111110110011100111101011001110111110100111100010111110101010110011100101011110110010100000101100111110011111101101101101001110101110000101101000001110101111110101101011001110111110100111100010111110001011100001101001111110110010100110101111101110111110100111 e9a18de385bbece7acefa78beab395eca0b3e7edb4eb85a0ebf5acefa78be2e1a7eca6beefa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)