To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鵝???????????鵝???????????^ 111010100100000000111111001111110011111100111111001111110011111100111111001111110011111100111111001111111110101001000000001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 ea403f3f3f3f3f3f3f3f3f3f3fea403f3f3f3f3f3f3f3f3f3f3f5e
EUC-JP 鵝???????????鵝???????????^ 111100111010000100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111111001110100001001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 f3a13f3f3f3f3f3f3f3f3f3f3ff3a13f3f3f3f3f3f3f3f3f3f3f5e
UTF-8 鵝롨콊溜롩젺溜㎬콊溜롦녂鵝롨콊溜롩젺溜㎬콊溜롦녂^ 11101001101101011001110111101011101000011010100011101100101111011000101011101111101001111000101111101011101000011010100111101100101000001011101011101111101001111000101111100011100011101010110011101100101111011000101011101111101001111000101111101011101000011010011011101011100001011000001011101001101101011001110111101011101000011010100011101100101111011000101011101111101001111000101111101011101000011010100111101100101000001011101011101111101001111000101111100011100011101010110011101100101111011000101011101111101001111000101111101011101000011010011011101011100001011000001001011110 e9b59deba1a8ecbd8aefa78beba1a9eca0baefa78be38eacecbd8aefa78beba1a6eb8582e9b59deba1a8ecbd8aefa78beba1a9eca0baefa78be38eacecbd8aefa78beba1a6eb85825e
UHC 鵝롨콊溜롩젺溜㎬콊溜롦녂鵝롨콊溜롩젺溜㎬콊溜롦녂^ 11100100101111011000111011101000101100011000011011101010111111101000111011101001101000001010110111101010111111101010011111101000101100011000011011101010111111101000111011100110100001101011101011100100101111011000111011101000101100011000011011101010111111101000111011101001101000001010110111101010111111101010011111101000101100011000011011101010111111101000111011100110100001101011101001011110 e4bd8ee8b186eafe8ee9a0adeafea7e8b186eafe8ee686bae4bd8ee8b186eafe8ee9a0adeafea7e8b186eafe8ee686ba5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)