To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 岳??孺??幽??筌?????淫??沃??? 10001010011110000011111100111111100110110111110100111111001111111001011101001000001111110011111111100010101000110011111100111111001111110011111100111111100010001111101000111111001111111001011110000000001111110011111100111111 8a783f3f9b7d3f3f97483f3fe2a33f3f3f3f3f88fa3f3f97803f3f3f
EUC-JP 岳??孺??幽??筌?????淫??沃??? 10110011110110010011111100111111110101011101111000111111001111111100110110101001001111110011111111100100101001010011111100111111001111110011111100111111101100001111110000111111001111111100110111100000001111110011111100111111 b3d93f3fd5de3f3fcda93f3fe4a53f3f3f3f3fb0fc3f3fcde03f3f3f
UTF-8 岳묒빘孺얕굜幽녿눛筌뚭쑴理껓쬊淫뗫뀆沃샩쇰뀋 111001011011001010110011111010111010110010010010111010111011100110011000111001011010110110111010111011001001011010010101111010101011010110011100111001011011100110111101111010111000010110111111111010111000100010011011111001111010110110001100111010111001101010101101111011001001000110110100111011111010011110100100111010101011101110010011111011001010110010001010111001101011011110101011111010111001011110101011111010111000000010000110111001101011001010000011111011001000001110101001111011001000011110110000111010111000000010001011 e5b2b3ebac92ebb998e5adbaec9695eab59ce5b9bdeb85bfeb889be7ad8ceb9aadec91b4efa7a4eabb93ecac8ae6b7abeb97abeb8086e6b283ec83a9ec87b0eb808b
UHC 岳묒빘孺얕굜幽녿눛筌뚭쑴理껓쬊淫뗫뀆沃샩쇰뀋 1110010010111111100100011110110010010101101110011110101011101000101111101110100010000010100001001110101011101011100001101110101110000111101100111110111110100111100011001110101010111110101010011110110010110101100000111110111110100110101000001110101111100010100010111110101110000101100000101110100010101010100110001100111010111100111010111000010110000111 e4bf91ec95b9eae8bee88284eaeb86eb87b3efa78ceabea9ecb583efa6a0ebe28beb8582e8aa98cebceb8587

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)