To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 二??詐?蜂導??二??詐?蜂導??^ 100100111111000100111111001111111000110110111100001111111001011001001001100100111011000100111111001111111001001111110001001111110011111110001101101111000011111110010110010010011001001110110001001111110011111101011110 93f13f3f8dbc3f964993b13f3f93f13f3f8dbc3f964993b13f3f5e
EUC-JP 二??詐?蜂導檉?二??詐?蜂導檉?^ 11000110111100110011111100111111101110101011111000111111110010111010101011000110101100111000111111000101101110110011111111000110111100110011111100111111101110101011111000111111110010111010101011000110101100111000111111000101101110110011111101011110 c6f33f3fbabe3fcbaac6b38fc5bb3fc6f33f3fbabe3fcbaac6b38fc5bb3f5e
UTF-8 二쿰렱詐렱蜂導檉렊二쿰렱詐렱蜂導檉렊^ 11100100101110101000110011101100101111111011000011101011101000001011000111101000101010011001000011101011101000001011000111101000100111001000001011100101101100001000111011100110101010101000100111101011101000001000101011100100101110101000110011101100101111111011000011101011101000001011000111101000101010011001000011101011101000001011000111101000100111001000001011100101101100001000111011100110101010101000100111101011101000001000101001011110 e4ba8cecbfb0eba0b1e8a990eba0b1e89c82e5b08ee6aa89eba08ae4ba8cecbfb0eba0b1e8a990eba0b1e89c82e5b08ee6aa89eba08a5e
UHC 二쿰렱詐렱蜂導檉렊二쿰렱詐렱蜂導檉렊^ 11101100101000111100010011110001100011101011111011011110111100011000111010111110110111001111000011010011111101001110111111100000100011101010000111101100101000111100010011110001100011101011111011011110111100011000111010111110110111001111000011010011111101001110111111100000100011101010000101011110 eca3c4f18ebedef18ebedcf0d3f4efe08ea1eca3c4f18ebedef18ebedcf0d3f4efe08ea15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)