To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????GB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100011101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4742
SJIS-WIN 恁?恁??恁??恁?恁??恁??恁?恁?恁??GB 10011100100011000011111110011100100011000011111100111111100111001000110000111111001111111001110010001100001111111001110010001100001111110011111110011100100011000011111100111111100111001000110000111111100111001000110000111111100111001000110000111111001111110100011101000010 9c8c3f9c8c3f3f9c8c3f3f9c8c3f9c8c3f3f9c8c3f3f9c8c3f9c8c3f9c8c3f3f4742
EUC-JP 恁?恁??恁??恁?恁??恁??恁?恁?恁??GB 11010111111011000011111111010111111011000011111100111111110101111110110000111111001111111101011111101100001111111101011111101100001111110011111111010111111011000011111100111111110101111110110000111111110101111110110000111111110101111110110000111111001111110100011101000010 d7ec3fd7ec3f3fd7ec3f3fd7ec3fd7ec3f3fd7ec3f3fd7ec3fd7ec3fd7ec3f3f4742
UTF-8 恁찫恁⑹쮼恁⑹㎟恁찳恁⑹찈恁⑹쭠恁찳恁챍恁⑹쭥GB 1110011010000001100000011110110010110000101010111110011010000001100000011110001010010001101110011110110010101110101111001110011010000001100000011110001010010001101110011110001110001110100111111110011010000001100000011110110010110000101100111110011010000001100000011110001010010001101110011110110010110000100010001110011010000001100000011110001010010001101110011110110010101101101000001110011010000001100000011110110010110000101100111110011010000001100000011110110010110001100011011110011010000001100000011110001010010001101110011110110010101101101001010100011101000010 e68181ecb0abe68181e291b9ecaebce68181e291b9e38e9fe68181ecb0b3e68181e291b9ecb088e68181e291b9ecada0e68181ecb0b3e68181ecb18de68181e291b9ecada54742
UHC 恁찫恁⑹쮼恁⑹㎟恁찳恁⑹찈恁⑹쭠恁찳恁챍恁⑹쭥GB 111011001111011010101010010001001110110011110110101010011110110010101000100110001110110011110110101010011110110010100111101100011110110011110110101010100100100111101100111101101010100111101100101010011000110011101100111101101010100111101100101001111001010111101100111101101010101001001001111011001111011010101010010110011110110011110110101010011110110010100111100110010100011101000010 ecf6aa44ecf6a9eca898ecf6a9eca7b1ecf6aa49ecf6a9eca98cecf6a9eca795ecf6aa49ecf6aa59ecf6a9eca7994742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)