To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???乙??幽??馭?????恂ы?孃??^ 00111111001111110011111110001001101100110011111100111111100101110100100000111111001111111110100101100110001111110011111100111111001111110011111110011100100101101000010010001101001111111001101101101111001111110011111101011110 3f3f3f89b33f3f97483f3fe9663f3f3f3f3f9c96848d3f9b6f3f3f5e
EUC-JP ???乙??幽??馭??沅??恂ы?孃??^ 001111110011111100111111101100101011010100111111001111111100110110101001001111110011111111110001110001110011111100111111100011111100011011101001001111110011111111010111111101101010011111101101001111111101010111010000001111110011111101011110 3f3f3fb2b53f3fcda93f3ff1c73f3f8fc6e93f3fd7f6a7ed3fd5d03f3f5e
UTF-8 列룸씍乙ⓨ뿽幽덌폊馭곥렞沅욅뙴恂ы맚孃뉕뉵^ 111011111010011010011100111010111010001110111000111011001001010010001101111001001011100110011001111000101001001110101000111010111011111110111101111001011011100110111101111010111000110110001100111011011000111110001010111010011010011010101101111010101011001110100101111010111010000010011110111001101011001010000101111011001001101010000101111010111001100110110100111001101000000110000010110100011000101111101011101001111001101011100101101011011000001111101011100010011001010111101011100010011011010101011110 efa69ceba3b8ec948de4b999e293a8ebbfbde5b9bdeb8d8ced8f8ae9a6adeab3a5eba09ee6b285ec9a85eb99b4e68182d18beba79ae5ad83eb8995eb89b55e
UHC 列룸씍乙ⓨ뿽幽덌폊馭곥렞沅욅뙴恂ы맚孃뉕뉵^ 11100110111010101011011111101011100111011010010011101011111000001010100011100101100101111011110111101010111010111000100011101111101111001001010111100101110111111000000111100011100011101010111111101010101101101001111011100111100011001011011111100010111000011010110011101101100100001010101011100101101111101000011111101010101101001011101101011110 e6eab7eb9da4ebe0a8e597bdeaeb88efbc95e5df81e38eafeab69ee78cb7e2e1aced90aae5be87eab4bb5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)