To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????o[??????????o[^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111011011110101101100111111001111110011111100111111001111110011111100111111001111110011111100111111011011110101101101011110 3f3f3f3f3f3f3f3f3f3f6f5b3f3f3f3f3f3f3f3f3f3f6f5b5e
SJIS-WIN 鏑?郵?臍??僥??o[鏑?郵?臍??僥??o[^ 100100110100110000111111100101110101100000111111111001000110000000111111001111111001100101000110001111110011111101101111010110111001001101001100001111111001011101011000001111111110010001100000001111110011111110011001010001100011111100111111011011110101101101011110 934c3f97583fe4603f3f99463f3f6f5b934c3f97583fe4603f3f99463f3f6f5b5e
EUC-JP 鏑?郵?臍??僥??o[鏑?郵?臍??僥??o[^ 110001011010110100111111110011011011100100111111111001111100000100111111001111111101000110100111001111110011111101101111010110111100010110101101001111111100110110111001001111111110011111000001001111110011111111010001101001110011111100111111011011110101101101011110 c5ad3fcdb93fe7c13f3fd1a73f3f6f5bc5ad3fcdb93fe7c13f3fd1a73f3f6f5b5e
UTF-8 鏑렊郵렮臍잴쓺僥당밞o[鏑렊郵렮臍잴쓺僥당밞o[^ 1110100110001111100100011110101110100000100010101110100110000011101101011110101110100000101011101110100010000111100011011110110010011110101101001110110010010011101110101110010110000011101001011110101110001011101110011110101110110000100111100110111101011011111010011000111110010001111010111010000010001010111010011000001110110101111010111010000010101110111010001000011110001101111011001001111010110100111011001001001110111010111001011000001110100101111010111000101110111001111010111011000010011110011011110101101101011110 e98f91eba08ae983b5eba0aee8878dec9eb4ec93bae583a5eb8bb9ebb09e6f5be98f91eba08ae983b5eba0aee8878dec9eb4ec93bae583a5eb8bb9ebb09e6f5b5e
UHC 鏑렊郵렮臍잴쓺僥당밞o[鏑렊郵렮臍잴쓺僥당밞o[^ 111011101110101110001110101000011110100111101000100011101011101111110000101100001100000011101010101111101011011011101000111010011011010011100111101110011110000101101111010110111110111011101011100011101010000111101001111010001000111010111011111100001011000011000000111010101011111010110110111010001110100110110100111001111011100111100001011011110101101101011110 eeeb8ea1e9e88ebbf0b0c0eabeb6e8e9b4e7b9e16f5beeeb8ea1e9e88ebbf0b0c0eabeb6e8e9b4e7b9e16f5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)