To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???靭?㎡???筌??愉??幽?????? 001111110011111100111111100100000111100000111111100001110111010100111111001111110011111111100010101000110011111100111111100101101111100100111111001111111001011101001000001111110011111100111111001111110011111100111111 3f3f3f90783f87753f3f3fe2a33f3f96f93f3f97483f3f3f3f3f3f
EUC-JP ???靭??洹??筌??愉??幽??孼??? 001111110011111100111111101111111101100100111111001111111000111111000111101110100011111100111111111001001010010100111111001111111100110011111011001111110011111111001101101010010011111100111111100011111011101011000011001111110011111100111111 3f3f3fbfd93f3f8fc7ba3f3fe4a53f3fccfb3f3fcda93f3f8fbac33f3f3f
UTF-8 麗몃쓷靭뚳㎡洹앸젘筌먯쉸愉뉒솾幽됰섣孼뽏룸룆 111011111010011010001000111010111010101010000011111011001001001110110111111010011001110110101101111010111001101010110011111000111000111010100001111001101011010010111001111011001001010110111000111011001010000010011000111001111010110110001100111010111010100010101111111011001000100110111000111001101000010010001001111010111000100110010010111011001000011010111110111001011011100110111101111010111001000010110000111011001000010010100011111001011010110110111100111010111011110110001111111010111010001110111000111010111010001110000110 efa688ebaa83ec93b7e99dadeb9ab3e38ea1e6b4b9ec95b8eca098e7ad8ceba8afec89b8e68489eb8992ec86bee5b9bdeb90b0ec84a3e5adbcebbd8feba3b8eba386
UHC 麗몃쓷靭뚳㎡洹앸젘筌먯쉸愉뉒솾幽됰섣孼뽏룸룆 1110011010110000101110001110101110011101100101001110110011100101100011001110111110100111101100111110101010110111100111011110101110100000100101001110111110100111100100001110110010011010100011101110101011110000100001111110011110011001101100101110101011101011100010011110101110111100101100101110010111101101100101101100111010110111111010111000111110000101 e6b0b8eb9d94ece58cefa7b3eab79deba094efa790ec9a8eeaf087e799b2eaeb89ebbcb2e5ed96ceb7eb8f85

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)