To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????nf????n^}Y????nf????n^}bE 0011111100111111001111110011111101101110011001100011111100111111001111110011111101101110010111100111110101011001001111110011111100111111001111110110111001100110001111110011111100111111001111110110111001011110011111010110001001000101 3f3f3f3f6e663f3f3f3f6e5e7d593f3f3f3f6e663f3f3f3f6e5e7d6245
SJIS-WIN 惟??阜nf惟??阜n^}Y惟??阜nf惟??阜n^}bE 10001000110100100011111100111111100101011000110001101110011001101000100011010010001111110011111110010101100011000110111001011110011111010101100110001000110100100011111100111111100101011000110001101110011001101000100011010010001111110011111110010101100011000110111001011110011111010110001001000101 88d23f3f958c6e6688d23f3f958c6e5e7d5988d23f3f958c6e6688d23f3f958c6e5e7d6245
EUC-JP 惟??阜nf惟??阜n^}Y惟??阜nf惟??阜n^}bE 10110000110101000011111100111111110010011110110001101110011001101011000011010100001111110011111111001001111011000110111001011110011111010101100110110000110101000011111100111111110010011110110001101110011001101011000011010100001111110011111111001001111011000110111001011110011111010110001001000101 b0d43f3fc9ec6e66b0d43f3fc9ec6e5e7d59b0d43f3fc9ec6e66b0d43f3fc9ec6e5e7d6245
UTF-8 惟몇렒阜nf惟몇렒阜n^}Y惟몇렒阜nf惟몇렒阜n^}bE 11100110100000111001111111101011101010101000011111101011101000001001001011101001100110001001110001101110011001101110011010000011100111111110101110101010100001111110101110100000100100101110100110011000100111000110111001011110011111010101100111100110100000111001111111101011101010101000011111101011101000001001001011101001100110001001110001101110011001101110011010000011100111111110101110101010100001111110101110100000100100101110100110011000100111000110111001011110011111010110001001000101 e6839febaa87eba092e9989c6e66e6839febaa87eba092e9989c6e5e7d59e6839febaa87eba092e9989c6e66e6839febaa87eba092e9989c6e5e7d6245
UHC 惟몇렒阜nf惟몇렒阜n^}Y惟몇렒阜nf惟몇렒阜n^}bE 111010101110111010111000111011101000111010100111110111011011110101101110011001101110101011101110101110001110111010001110101001111101110110111101011011100101111001111101010110011110101011101110101110001110111010001110101001111101110110111101011011100110011011101010111011101011100011101110100011101010011111011101101111010110111001011110011111010110001001000101 eaeeb8ee8ea7ddbd6e66eaeeb8ee8ea7ddbd6e5e7d59eaeeb8ee8ea7ddbd6e66eaeeb8ee8ea7ddbd6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)