To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??Tkf??Tk^}Y??Tkf??Tk^}bE 00111111001111110101010001101011011001100011111100111111010101000110101101011110011111010101100100111111001111110101010001101011011001100011111100111111010101000110101101011110011111010110001001000101 3f3f546b663f3f546b5e7d593f3f546b663f3f546b5e7d6245
SJIS-WIN 陌始Tkf陌始Tk^}Y陌始Tkf陌始Tk^}bE 111010001001100110001110011011100101010001101011011001101110100010011001100011100110111001010100011010110101111001111101010110011110100010011001100011100110111001010100011010110110011011101000100110011000111001101110010101000110101101011110011111010110001001000101 e8998e6e546b66e8998e6e546b5e7d59e8998e6e546b66e8998e6e546b5e7d6245
EUC-JP 陌始Tkf陌始Tk^}Y陌始Tkf陌始Tk^}bE 111011111111100110111011110011110101010001101011011001101110111111111001101110111100111101010100011010110101111001111101010110011110111111111001101110111100111101010100011010110110011011101111111110011011101111001111010101000110101101011110011111010110001001000101 eff9bbcf546b66eff9bbcf546b5e7d59eff9bbcf546b66eff9bbcf546b5e7d6245
UTF-8 陌始Tkf陌始Tk^}Y陌始Tkf陌始Tk^}bE 1110100110011001100011001110010110100111100010110101010001101011011001101110100110011001100011001110010110100111100010110101010001101011010111100111110101011001111010011001100110001100111001011010011110001011010101000110101101100110111010011001100110001100111001011010011110001011010101000110101101011110011111010110001001000101 e9998ce5a78b546b66e9998ce5a78b546b5e7d59e9998ce5a78b546b66e9998ce5a78b546b5e7d6245
UHC 陌始Tkf陌始Tk^}Y陌始Tkf陌始Tk^}bE 110110001110100011100011101101110101010001101011011001101101100011101000111000111011011101010100011010110101111001111101010110011101100011101000111000111011011101010100011010110110011011011000111010001110001110110111010101000110101101011110011111010110001001000101 d8e8e3b7546b66d8e8e3b7546b5e7d59d8e8e3b7546b66d8e8e3b7546b5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)