To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 阡オ貂ャ諱ッ譌ヲn}阡オ貂ャ諱ッ譌ヲn{^ 1110100010010100101101011110011010111000101011001110011010000001101011111110011010010111101001100110111001111101111010001001010010110101111001101011100010101100111001101000000110101111111001101001011110100110011011100111101101011110 e894b5e6b8ace681afe697a66e7de894b5e6b8ace681afe697a66e7b5e
EUC-JP 阡オ貂ャ諱ッ譌ヲn}阡オ貂ャ諱ッ譌ヲn{^ 11101111111101001000111010110101111011001011101010001110101011001110101111100001100011101010111111101011111101111000111010100110011011100111110111101111111101001000111010110101111011001011101010001110101011001110101111100001100011101010111111101011111101111000111010100110011011100111101101011110 eff48eb5ecba8eacebe18eafebf78ea66e7deff48eb5ecba8eacebe18eafebf78ea66e7b5e
UTF-8 阡オ貂ャ諱ッ譌ヲn}阡オ貂ャ諱ッ譌ヲn{^ 1110100110011000101000011110111110111101101101011110100010110010100000101110111110111101101011001110100010101011101100011110111110111101101011111110100010101101100011001110111110111101101001100110111001111101111010011001100010100001111011111011110110110101111010001011001010000010111011111011110110101100111010001010101110110001111011111011110110101111111010001010110110001100111011111011110110100110011011100111101101011110 e998a1efbdb5e8b282efbdace8abb1efbdafe8ad8cefbda66e7de998a1efbdb5e8b282efbdace8abb1efbdafe8ad8cefbda66e7b5e
UHC 阡?貂?諱???n}阡?貂?諱???n{^ 111101001100011000111111111101011011000000111111111111011100100100111111001111110011111101101110011111011111010011000110001111111111010110110000001111111111110111001001001111110011111100111111011011100111101101011110 f4c63ff5b03ffdc93f3f3f6e7df4c63ff5b03ffdc93f3f3f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)