To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 弱??節わ?節←?絶??五∽?節o????^ 1000111011100011001111110011111110010000110111111000001011101101001111111001000011011111100000011010100100111111100100001110001000111111001111111000110011011100100000011110010000111111100100001101111110000010100011110011111100111111001111110011111101011110 8ee33f3f90df82ed3f90df81a93f90e23f3f8cdc81e43f90df828f3f3f3f3f5e
EUC-JP 弱??節わ?節←?絶??五∽?節o????^ 1011110011100101001111110011111111000000111000011010010011101111001111111100000011100001101000101010101100111111110000001110010000111111001111111011100011011110101000101110011000111111110000001110000110100011111011110011111100111111001111110011111101011110 bce53f3fc0e1a4ef3fc0e1a2ab3fc0e43f3fb8dea2e63fc0e1a3ef3f3f3f3f5e
UTF-8 弱녻떋節わ숲節←닇絶뚦컛五∽쉠節o숯連먲풄^ 11100101101111001011000111101011100001011011101111101011100101101000101111100111101011111000000011100011100000101000111111101100100010001011001011100111101011111000000011100010100001101001000011101011100010111000011111100111101101011011011011101011100110101010011011101100101110111001101111100100101110101001010011100010100010001011110111101100100010011010000011100111101011111000000011101111101111011000111111101100100010001010111111101111101001101001101011101011101010001011001011101101100100101000010001011110 e5bcb1eb85bbeb968be7af80e3828fec88b2e7af80e28690eb8b87e7b5b6eb9aa6ecbb9be4ba94e288bdec89a0e7af80efbd8fec88afefa69aeba8b2ed92845e
UHC 弱녻떋節わ숲節←닇絶뚦컛五∽쉠節o숯連먲풄^ 11100101101100001000011011101000100010111010000111101111101111011010101011101111101111011010001111101111101111011010000111100111100010001001000011101111101111101000110011100101101100001000011011100111111010011010000111101111101111011010101011101111101111011010001111101111101111011010000111100110111001101001000011101111101111101000110001011110 e5b086e88ba1efbdaaefbda3efbda1e78890efbe8ce5b086e7e9a1efbdaaefbda3efbda1e6e690efbe8c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)