To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????s 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101110011 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f73
SJIS-WIN ???鶯?????釗??????????????s 001111110011111100111111111010011111001000111111001111110011111100111111001111111111101110111011001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101110011 3f3f3fe9f23f3f3f3f3ffbbb3f3f3f3f3f3f3f3f3f3f3f3f3f3f73
EUC-JP ???鶯?????釗??????????????s 00111111001111110011111111110010111101000011111100111111001111110011111100111111100011111110001110100110001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101110011 3f3f3ff2f43f3f3f3f3f8fe3a63f3f3f3f3f3f3f3f3f3f3f3f3f3f73
UTF-8 溜삘뵗鶯쇺뵗溜딅졎釗숉뙇栒붾젘溜삘뵗溜㏓졎栒붾젧s 11101111101001111000101111101100100000101001100011101011101101011001011111101001101101101010111111101100100001111011101011101011101101011001011111101111101001111000101111101011100101001000010111101100101000011000111011101001100001111001011111101100100010001000100111101011100110011000011111100110101000001001001011101011101101101011111011101100101000001001100011101111101001111000101111101100100000101001100011101011101101011001011111101111101001111000101111100011100011111001001111101100101000011000111011100110101000001001001011101011101101101011111011101100101000001010011101110011 efa78bec8298ebb597e9b6afec87baebb597efa78beb9485eca18ee98797ec8889eb9987e6a092ebb6beeca098efa78bec8298ebb597efa78be38f93eca18ee6a092ebb6beeca0a773
UHC 溜삘뵗鶯쇺뵗溜딅졎釗숉뙇栒붾젘溜삘뵗溜㏓졎栒붾젧s 11101010111111101011101111100010100101001001100111100101101000111001100111100010100101001001100111101010111111101000101011101011101000001011101111100001111100101001100111101101100011001000110111100010111000111001010011101011101000001001010011101010111111101011101111100010100101001001100111101010111111101010011111101011101000001011101111100010111000111001010011101011101000001001111101110011 eafebbe29499e5a399e29499eafe8aeba0bbe1f299ed8c8de2e394eba094eafebbe29499eafea7eba0bbe2e394eba09f73

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)