To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???冶⑤???????陰?????淫??^ 0011111100111111001111111001011011101000100001110100010000111111001111110011111100111111001111110011111100111111100010010100000100111111001111110011111100111111001111111000100011111010001111110011111101011110 3f3f3f96e887443f3f3f3f3f3f3f89413f3f3f3f3f88fa3f3f5e
EUC-JP ???冶????????陰?????淫??^ 00111111001111110011111111001100111010100011111100111111001111110011111100111111001111110011111100111111101100011010001000111111001111110011111100111111001111111011000011111100001111110011111101011110 3f3f3fccea3f3f3f3f3f3f3f3fb1a23f3f3f3f3fb0fc3f3f5e
UTF-8 溜깅젡冶⑤젿溜싳꺏溜깅젛陰붾젿溜싨꽋淫쒖꽘^ 11101111101001111000101111101010101110011000010111101100101000001010000111100101100001101011011011100010100100011010010011101100101000001011111111101111101001111000101111101100100010111011001111101010101110101000111111101111101001111000101111101010101110011000010111101100101000001001101111101001100110011011000011101011101101101011111011101100101000001011111111101111101001111000101111101100100010111010100011101010101111011000101111100110101101111010101111101100100100101001011011101010101111011001100001011110 efa78beab985eca0a1e586b6e291a4eca0bfefa78bec8bb3eaba8fefa78beab985eca09be999b0ebb6beeca0bfefa78bec8ba8eabd8be6b7abec9296eabd985e
UHC 溜깅젡冶⑤젿溜싳꺏溜깅젛陰붾젿溜싨꽋淫쒖꽘^ 11101010111111101011000111101011101000001001101011100101101001111010100011101011101000001011000111101010111111101001101011101100100000111011010111101010111111101011000111101011101000001001011111101011111001001001010011101011101000001011000111101010111111101001101011100110100001001001101111101011111000101001110011101100100001001010011101011110 eafeb1eba09ae5a7a8eba0b1eafe9aec83b5eafeb1eba097ebe494eba0b1eafe9ae6849bebe29cec84a75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)