To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?る?毅??怨??阿??苑??魏?????一 0011111110000010111010010011111110001011010000100011111100111111100010011000010100111111001111111000100010100010001111110011111110001001100100010011111100111111111010011011000000111111001111110011111100111111001111111000100011101010 3f82e93f8b423f3f89853f3f88a23f3f89913f3fe9b03f3f3f3f3f88ea
EUC-JP ?る?毅??怨??阿??苑??魏??孼??一 00111111101001001110101100111111101101011010001100111111001111111011000111100101001111110011111110110000101001000011111100111111101100011111000100111111001111111111001010110010001111110011111110001111101110101100001100111111001111111011000011101100 3fa4eb3fb5a33f3fb1e53f3fb0a43f3fb1f13f3ff2b23f3f8fbac33f3fb0ec
UTF-8 閭る벡毅뺠짆怨⑹젂阿숋퐣苑섌쪛魏껎돪孼대같一 111011111010011010000110111000111000001010001011111010111011001010100001111001101010111110000101111010111011101010100000111011001010011110000110111001101000000010101000111000101001000110111001111011001010000010000010111010011001100010111111111011001000100010001011111011011001000010100011111010001000101110010001111011001000010010001100111011001010101010011011111010011010110110001111111010101011101110001110111010111000111110101010111001011010110110111100111010111000110010000000111010101011000010011001111001001011100010000000 efa686e3828bebb2a1e6af85ebbaa0eca786e680a8e291b9eca082e998bfec888bed90a3e88b91ec848cecaa9be9ad8feabb8eeb8faae5adbceb8c80eab099e4b880
UHC 閭る벡毅뺠짆怨⑹젂阿숋퐣苑섌쪛魏껎돪孼대같一 1110011010101101101010101110101110111010101001001110101111110110100101011110100010100011100101011110101010110011101010011110110010100000100001101110010010111001100110011110111110111101100011001110101010111101100110001110100110100101100101001110101011100000100000111110110110001001101011011110010111101101101101001110101110110000101100001110110011101001 e6adaaebbaa4ebf695e8a395eab3a9eca086e4b999efbd8ceabd98e9a594eae083ed89ade5edb4ebb0b0ece9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)