To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?氓???韶傲???淫??氓???韶傲???淫?^ 001111111001111110000010001111110011111100111111111010001110111010011000111111000011111100111111001111111000100011111010001111110011111110011111100000100011111100111111001111111110100011101110100110001111110000111111001111110011111110001000111110100011111101011110 3f9f823f3f3fe8ee98fc3f3f3f88fa3f3f9f823f3f3fe8ee98fc3f3f3f88fa3f5e
EUC-JP ?氓???韶傲???淫??氓???韶傲???淫?^ 001111111101110111100010001111110011111100111111111100001111000011010000111111100011111100111111001111111011000011111100001111110011111111011101111000100011111100111111001111111111000011110000110100001111111000111111001111110011111110110000111111000011111101011110 3fdde23f3f3ff0f0d0fe3f3f3fb0fc3f3fdde23f3f3ff0f0d0fe3f3f3fb0fc3f5e
UTF-8 뤗氓등춲븟韶傲퉶엌렢淫급뤗氓등춲븟韶傲퉶엌렢淫긁^ 11101011101001001001011111100110101100001001001111101011100100111011000111101100101101101011001011101011101110001001111111101001100111111011011011100101100000101011001011101101100010011011011011101100100101111000110011101011101000001010001011100110101101111010101111101010101110001000100111101011101001001001011111100110101100001001001111101011100100111011000111101100101101101011001011101011101110001001111111101001100111111011011011100101100000101011001011101101100010011011011011101100100101111000110011101011101000001010001011100110101101111010101111101010101110001000000101011110 eba497e6b093eb93b1ecb6b2ebb89fe99fb6e582b2ed89b6ec978ceba0a2e6b7abeab889eba497e6b093eb93b1ecb6b2ebb89fe99fb6e582b2ed89b6ec978ceba0a2e6b7abeab8815e
UHC 뤗氓등춲븟韶傲퉶엌렢淫급뤗氓등춲븟韶傲퉶엌렢淫긁^ 10001111110001111101100011101100101101011110111010101101100011101011101011110000111000011101001011100111111011001011100110001110101111101111110110001110101100111110101111100010101100011101111010001111110001111101100011101100101101011110111010101101100011101011101011110000111000011101001011100111111011001011100110001110101111101111110110001110101100111110101111100010101100011101110001011110 8fc7d8ecb5eead8ebaf0e1d2e7ecb98ebefd8eb3ebe2b1de8fc7d8ecb5eead8ebaf0e1d2e7ecb98ebefd8eb3ebe2b1dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)