To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 伍??厓??亦??[伍??厓??亦??[^ 100011001101111000111111001111111111101010001101001111110011111110010110100100100011111100111111010110111000110011011110001111110011111111111010100011010011111100111111100101101001001000111111001111110101101101011110 8cde3f3ffa8d3f3f96923f3f5b8cde3f3ffa8d3f3f96923f3f5b5e
EUC-JP 伍??厓??亦??[伍??厓??亦??[^ 1011100011100000001111110011111110001111101101001100011100111111001111111100101111110010001111110011111101011011101110001110000000111111001111111000111110110100110001110011111100111111110010111111001000111111001111110101101101011110 b8e03f3f8fb4c73f3fcbf23f3f5bb8e03f3f8fb4c73f3fcbf23f3f5b5e
UTF-8 伍닸꼬厓김뙱亦욕뤀[伍닸꼬厓김뙱亦욕뤀[^ 111001001011110010001101111010111000101110111000111010101011110010101100111001011000111010010011111010101011100110000000111010111001100110110001111001001011101010100110111011001001101010010101111010111010010010000000010110111110010010111100100011011110101110001011101110001110101010111100101011001110010110001110100100111110101010111001100000001110101110011001101100011110010010111010101001101110110010011010100101011110101110100100100000000101101101011110 e4bc8deb8bb8eabcace58e93eab980eb99b1e4baa6ec9a95eba4805be4bc8deb8bb8eabcace58e93eab980eb99b1e4baa6ec9a95eba4805b5e
UHC 伍닸꼬厓김뙱亦욕뤀[伍닸꼬厓김뙱亦욕뤀[^ 111001111110101010110100111001101011001010111111111001001110110110110001111010001000110010110100111001101011001010111111111001011000111110110001010110111110011111101010101101001110011010110010101111111110010011101101101100011110100010001100101101001110011010110010101111111110010110001111101100010101101101011110 e7eab4e6b2bfe4edb1e88cb4e6b2bfe58fb15be7eab4e6b2bfe4edb1e88cb4e6b2bfe58fb15b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)