To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????N}??????????N{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111010011100111110100111111001111110011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 鞨簿立妤塗鞨簿立妤吐N}鞨簿立妤塗鞨簿立妤吐N{^ 111010001110000010010101111010111001011110100111111110101010010010010011011010001110100011100000100101011110101110010111101001111111101010100100100100110110011001001110011111011110100011100000100101011110101110010111101001111111101010100100100100110110100011101000111000001001010111101011100101111010011111111010101001001001001101100110010011100111101101011110 e8e095eb97a7faa49368e8e095eb97a7faa493664e7de8e095eb97a7faa49368e8e095eb97a7faa493664e7b5e
EUC-JP 鞨簿立妤塗鞨簿立妤吐N}鞨簿立妤塗鞨簿立妤吐N{^ 11110000111000101100101011101101110011101010100110001111101110011010111111000101110010011111000011100010110010101110110111001110101010011000111110111001101011111100010111000111010011100111110111110000111000101100101011101101110011101010100110001111101110011010111111000101110010011111000011100010110010101110110111001110101010011000111110111001101011111100010111000111010011100111101101011110 f0e2caedcea98fb9afc5c9f0e2caedcea98fb9afc5c74e7df0e2caedcea98fb9afc5c9f0e2caedcea98fb9afc5c74e7b5e
UTF-8 鞨簿立妤塗鞨簿立妤吐N}鞨簿立妤塗鞨簿立妤吐N{^ 1110100110011110101010001110011110110000101111111110011110101011100010111110010110100110101001001110010110100001100101111110100110011110101010001110011110110000101111111110011110101011100010111110010110100110101001001110010110010000100100000100111001111101111010011001111010101000111001111011000010111111111001111010101110001011111001011010011010100100111001011010000110010111111010011001111010101000111001111011000010111111111001111010101110001011111001011010011010100100111001011001000010010000010011100111101101011110 e99ea8e7b0bfe7ab8be5a6a4e5a197e99ea8e7b0bfe7ab8be5a6a4e590904e7de99ea8e7b0bfe7ab8be5a6a4e5a197e99ea8e7b0bfe7ab8be5a6a4e590904e7b5e
UHC 鞨簿立?塗鞨簿立?吐N}鞨簿立?塗鞨簿立?吐N{^ 1100101011101010110111011010110111011000101000010011111111010011111100111100101011101010110111011010110111011000101000010011111111110111110011100100111001111101110010101110101011011101101011011101100010100001001111111101001111110011110010101110101011011101101011011101100010100001001111111111011111001110010011100111101101011110 caeaddadd8a13fd3f3caeaddadd8a13ff7ce4e7dcaeaddadd8a13fd3f3caeaddadd8a13ff7ce4e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)