To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 野????碎?????野????碎?????B 100101101110110000111111001111110011111100111111111000011110101000111111001111110011111100111111001111111001011011101100001111110011111100111111001111111110000111101010001111110011111100111111001111110011111101000010 96ec3f3f3f3fe1ea3f3f3f3f3f96ec3f3f3f3fe1ea3f3f3f3f3f42
EUC-JP 野????碎?????野????碎?????B 110011001110111000111111001111110011111100111111111000101110110000111111001111110011111100111111001111111100110011101110001111110011111100111111001111111110001011101100001111110011111100111111001111110011111101000010 ccee3f3f3f3fe2ec3f3f3f3f3fccee3f3f3f3fe2ec3f3f3f3f3f42
UTF-8 野ㅞ뼛꿩뤃碎띠뜏歷몃뿿野ㅞ뼛꿩뤃碎띠뜏歷몃뿿B 11101001100001111000111011100011100001011001111011101011101111001001101111101010101111111010100111101011101001001000001111100111101000101000111011101011100111011010000011101011100111001000111111101111101001101000110011101011101010101000001111101011101111111011111111101001100001111000111011100011100001011001111011101011101111001001101111101010101111111010100111101011101001001000001111100111101000101000111011101011100111011010000011101011100111001000111111101111101001101000110011101011101010101000001111101011101111111011111101000010 e9878ee3859eebbc9beabfa9eba483e7a28eeb9da0eb9c8fefa68cebaa83ebbfbfe9878ee3859eebbc9beabfa9eba483e7a28eeb9da0eb9c8fefa68cebaa83ebbfbf42
UHC 野ㅞ뼛꿩뤃碎띠뜏歷몃뿿野ㅞ뼛꿩뤃碎띠뜏歷몃뿿B 111001011010111110100100110011101011101111000100101100101110011010001111101101001110000111101111101101101110110010001101100100101110011010111000101110001110101110010111101111111110010110101111101001001100111010111011110001001011001011100110100011111011010011100001111011111011011011101100100011011001001011100110101110001011100011101011100101111011111101000010 e5afa4cebbc4b2e68fb4e1efb6ec8d92e6b8b8eb97bfe5afa4cebbc4b2e68fb4e1efb6ec8d92e6b8b8eb97bf42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)