To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 倭??踰??應???ル?宥??魏??鵝?~ 1001100001100000001111110011111111100110111110100011111100111111100111001110010000111111001111110011111110000011100010110011111110010111010001110011111100111111111010011011000000111111001111111110101001000000001111111000000101100000 98603f3fe6fa3f3f9ce43f3f3f838b3f97473f3fe9b03f3fea403f8160
EUC-JP 倭??踰??應???ル?宥??魏??鵝?〜 1100111111000001001111110011111111101100111111000011111100111111110110001110011000111111001111110011111110100101111010110011111111001101101010000011111100111111111100101011001000111111001111111111001110100001001111111010000111000001 cfc13f3fecfc3f3fd8e63f3f3fa5eb3fcda83f3ff2b23f3ff3a13fa1c1
UTF-8 倭녾낮踰싨쨫應밸쇊曆ル뵁宥욇쪛魏껎돧鵝얠~ 111001011000000010101101111010111000010110111110111010111000001010101110111010001011100010110000111011001000101110101000111011001010100010101011111001101000011110001001111010111011000010111000111011001000011110001010111011111010011010001011111000111000001110101011111010111011010110000001111001011010111010100101111011001001101010000111111011001010101010011011111010011010110110001111111010101011101110001110111010111000111110100111111010011011010110011101111011001001011010100000111011111011110110011110 e580adeb85beeb82aee8b8b0ec8ba8eca8abe68789ebb0b8ec878aefa68be383abebb581e5aea5ec9a87ecaa9be9ad8feabb8eeb8fa7e9b59dec96a0efbd9e
UHC 倭녾낮踰싨쨫應밸쇊曆ル뵁宥욇쪛魏껎돧鵝얠~ 111010001101111010000110111010101011001110110111111010111011001010011010111001101010010010000101111010111110101110111001111010111001100110111100111001101011011110101011111010111001010010000111111010101110100110011110111010011010010110010100111010101110000010000011111011011000100110101011111001001011110110111110111011001010001010100110 e8de86eab3b7ebb29ae6a485ebebb9eb99bce6b7abeb9487eae99ee9a594eae083ed89abe4bdbeeca2a6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)