To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN ??????妊?│}??????妊?│{^ 00111111001111110011111100111111001111110011111110010100010001000011111110000100101000000111110100111111001111110011111100111111001111110011111110010100010001000011111110000100101000000111101101011110 3f3f3f3f3f3f94443f84a07d3f3f3f3f3f3f94443f84a07b5e
EUC-JP ??????妊?│}??????妊?│{^ 00111111001111110011111100111111001111110011111111000111101001010011111110101000101000100111110100111111001111110011111100111111001111110011111111000111101001010011111110101000101000100111101101011110 3f3f3f3f3f3fc7a53fa8a27d3f3f3f3f3f3fc7a53fa8a27b5e
UTF-8 溜노죨琉블뀞妊듭│}溜노죨琉블뀞妊듭│{^ 111011111010011110001011111010111000010110111000111011001010001110101000111011111010011110001100111010111011100010010100111010111000000010011110111001011010011010001010111010111001001110101101111000101001010010000010011111011110111110100111100010111110101110000101101110001110110010100011101010001110111110100111100011001110101110111000100101001110101110000000100111101110010110100110100010101110101110010011101011011110001010010100100000100111101101011110 efa78beb85b8eca3a8efa78cebb894eb809ee5a68aeb93ade294827defa78beb85b8eca3a8efa78cebb894eb809ee5a68aeb93ade294827b5e
UHC 溜노죨琉블뀞妊듭│}溜노죨琉블뀞妊듭│{^ 111010101111111010110011111010111010000110000011111010111010010010111010111011011000010110010101111011001111010010110101111011001010011010100010011111011110101011111110101100111110101110100001100000111110101110100100101110101110110110000101100101011110110011110100101101011110110010100110101000100111101101011110 eafeb3eba183eba4baed8595ecf4b5eca6a27deafeb3eba183eba4baed8595ecf4b5eca6a27b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)