To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 茹??魏?ぜ邑??語⑤?壹??矜裕??幽?? 1110010010100101001111110011111111101001101100000011111110000010101110101001011101010111001111110011111110001100111010101000011101000100001111111001101011100011001111110011111111100001111000001001011101010100001111110011111110010111010010000011111100111111 e4a53f3fe9b03f82ba97573f3f8cea87443f9ae33f3fe1e097543f3f97483f3f
EUC-JP 茹??魏?ぜ邑??語??壹??矜裕??幽?? 11101000101001110011111100111111111100101011001000111111101001001011110011001101101110000011111100111111101110001110110000111111001111111101010011100101001111110011111111100010111000101100110110110101001111110011111111001101101010010011111100111111 e8a73f3ff2b23fa4bccdb83f3fb8ec3f3fd4e53f3fe2e2cdb53f3fcda93f3f
UTF-8 茹띾봾魏꾥ぜ邑⑸꽠語⑤챶壹얏뀆矜裕귚펺幽껊겱 111010001000110010111001111010111001110110111110111010111011010010111110111010011010110110001111111010101011111010100101111000111000000110011100111010011000001010010001111000101001000110111000111010101011110110100000111010001010101010011110111000101001000110100100111011001011000110110110111001011010001110111001111011001001011010001111111010111000000010000110111001111001111110011100111010001010001110010101111010101011011110011010111011011000111010111010111001011011100110111101111010101011101110001010111010101011001010110001 e88cb9eb9dbeebb4bee9ad8feabea5e3819ce98291e291b8eabda0e8aa9ee291a4ecb1b6e5a3b9ec968feb8086e79f9ce8a395eab79aed8ebae5b9bdeabb8aeab2b1
UHC 茹띾봾魏꾥ぜ邑⑸꽠語⑤챶壹얏뀆矜裕귚펺幽껊겱 1110011010101010100011011110101110010100100001011110101011100000100001001110100010101010101111001110101111101001101010011110101110000100101011011110010111011110101010001110101110101010100000111110110011101100101111101110011010000101100000101101000011101000111010111010111010000010111001001011110010001010111010101110101110000011111010111000000110111101 e6aa8deb9485eae084e8aabcebe9a9eb84ade5dea8ebaa83ececbee68582d0e8ebae82e4bc8aeaeb83eb81bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)