To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 野??????肉η?違??艾?????媛? 100101101110110000111111001111110011111100111111001111110011111110010011111101111000001111000101001111111000100011100001001111110011111111100100100010000011111100111111001111110011111100111111100101010101000100111111 96ec3f3f3f3f3f3f93f783c53f88e13f3fe4883f3f3f3f3f95513f
EUC-JP 野???孼??肉η?違??艾?????媛? 1100110011101110001111110011111100111111100011111011101011000011001111110011111111000110111110011010011011000111001111111011000011100011001111110011111111100111111010000011111100111111001111110011111100111111110010011011001000111111 ccee3f3f3f8fbac33f3fc6f9a6c73fb0e33f3fe7e83f3f3f3f3fc9b23f
UTF-8 野ㅞ삠돘孼뽰쥋肉η솒違곷룈艾싳궠梨욘끽媛귻 1110100110000111100011101110001110000101100111101110110010000010101000001110101110001111100110001110010110101101101111001110101110111101101100001110110010100101100010111110100010000010100010011100111010110111111011001000011010010010111010011000000110010101111010101011001110110111111010111010001110001000111010001000100110111110111011001000101110110011111010101011011010100000111011111010011110100010111011001001101010011000111010111000000110111101111001011010101010011011111010101011011110111011 e9878ee3859eec82a0eb8f98e5adbcebbdb0eca58be88289ceb7ec8692e98195eab3b7eba388e889beec8bb3eab6a0efa7a2ec9a98eb81bde5aa9beab7bb
UHC 野ㅞ삠돘孼뽰쥋肉η솒違곷룈艾싳궠梨욘끽媛귻 111001011010111110100100110011101011101111100011100010011010000111100101111011011001011011101100101000101000010011101011101111111010010111100111100110011001001011101010110111101000000111101011100011111000011111100100111101011001101011101100100000101011001111101100101100011011111111100110101100111010001111101010101100001000001101000010 e5afa4cebbe389a1e5ed96eca284ebbfa5e79992eade81eb8f87e4f59aec82b3ecb1bfe6b3a3eab08342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)