To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 枳???鄭?儲褐?繃枳???鄭?儲褐?棚^ 10011110011010110011111100111111001111111001001101000001001111111001011011010111100010101000110000111111111000110111110110011110011010110011111100111111001111111001001101000001001111111001011011010111100010101000110000111111100100100100100101011110 9e6b3f3f3f93413f96d78a8c3fe37d9e6b3f3f3f93413f96d78a8c3f92495e
EUC-JP 枳?雩?鄭?儲褐?繃枳?雩?鄭?儲褐?棚^ 1101101111001100001111111000111111100110111110100011111111000101101000100011111111001100110110011011001111101100001111111110010111011110110110111100110000111111100011111110011011111010001111111100010110100010001111111100110011011001101100111110110000111111110000111010101001011110 dbcc3f8fe6fa3fc5a23fccd9b3ec3fe5dedbcc3f8fe6fa3fc5a23fccd9b3ec3fc3aa5e
UTF-8 枳렟雩렮鄭렩儲褐렖繃枳렟雩렮鄭렩儲褐렖棚^ 11100110100111101011001111101011101000001001111111101001100110111010100111101011101000001010111011101001100001001010110111101011101000001010100111100101100001001011001011101000101001001001000011101011101000001001011011100111101110011000001111100110100111101011001111101011101000001001111111101001100110111010100111101011101000001010111011101001100001001010110111101011101000001010100111100101100001001011001011101000101001001001000011101011101000001001011011100110101000111001101001011110 e69eb3eba09fe99ba9eba0aee984adeba0a9e584b2e8a490eba096e7b983e69eb3eba09fe99ba9eba0aee984adeba0a9e584b2e8a490eba096e6a39a5e
UHC 枳렟雩렮鄭렩儲褐렖繃枳렟雩렮鄭렩儲褐렖棚^ 1111001010101100100011101011000011101001111011001000111010111011111011111111011110001110101101111110111010111001110010101110100010001110101010111101110111011110111100101010110010001110101100001110100111101100100011101011101111101111111101111000111010110111111011101011100111001010111010001000111010101011110111011101110001011110 f2ac8eb0e9ec8ebbeff78eb7eeb9cae88eabdddef2ac8eb0e9ec8ebbeff78eb7eeb9cae88eabdddc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)