To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 嚴ш?二??碎??n}嚴ш?二??碎??n{^ 10011010100011101000010010001010001111111001001111110001001111110011111111100001111010100011111100111111011011100111110110011010100011101000010010001010001111111001001111110001001111110011111111100001111010100011111100111111011011100111101101011110 9a8e848a3f93f13f3fe1ea3f3f6e7d9a8e848a3f93f13f3fe1ea3f3f6e7b5e
EUC-JP 嚴ш?二??碎??n}嚴ш?二??碎??n{^ 11010011111011101010011111101010001111111100011011110011001111110011111111100010111011000011111100111111011011100111110111010011111011101010011111101010001111111100011011110011001111110011111111100010111011000011111100111111011011100111101101011110 d3eea7ea3fc6f33f3fe2ec3f3f6e7dd3eea7ea3fc6f33f3fe2ec3f3f6e7b5e
UTF-8 嚴ш쑬二삼쭓碎몄죴n}嚴ш쑬二삼쭓碎몄죴n{^ 111001011001101010110100110100011000100011101100100100011010110011100100101110101000110011101100100000101011110011101100101011011001001111100111101000101000111011101011101010101000010011101100101000111011010001101110011111011110010110011010101101001101000110001000111011001001000110101100111001001011101010001100111011001000001010111100111011001010110110010011111001111010001010001110111010111010101010000100111011001010001110110100011011100111101101011110 e59ab4d188ec91ace4ba8cec82bcecad93e7a28eebaa84eca3b46e7de59ab4d188ec91ace4ba8cec82bcecad93e7a28eebaa84eca3b46e7b5e
UHC 嚴ш쑬二삼쭓碎몄죴n}嚴ш쑬二삼쭓碎몄죴n{^ 1110010111110001101011001110101010111110101010001110110010100011101110111110111110100111100010111110000111101111101110001110110010100001100011110110111001111101111001011111000110101100111010101011111010101000111011001010001110111011111011111010011110001011111000011110111110111000111011001010000110001111011011100111101101011110 e5f1aceabea8eca3bbefa78be1efb8eca18f6e7de5f1aceabea8eca3bbefa78be1efb8eca18f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)