To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 霓??巡??宥????????????k? 11101000101111010011111100111111100011111000010000111111001111111001011101000111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111100000101000101100111111 e8bd3f3f8f843f3f97473f3f3f3f3f3f3f3f3f3f3f3f828b3f
EUC-JP 霓??巡??宥??孼?????洧???k? 1111000010111111001111110011111110111101111001000011111100111111110011011010100000111111001111111000111110111010110000110011111100111111001111110011111100111111100011111100011110110100001111110011111100111111101000111110101100111111 f0bf3f3fbde43f3fcda83f3f8fbac33f3f3f3f3f8fc7b43f3f3fa3eb3f
UTF-8 霓낅뜄巡뺞끽宥몃옜孼꾩슜溜잍갭洧좎삖力k돭 111010011001110010010011111010111000001010000101111010111001110010000100111001011011011110100001111010111011101010011110111010111000000110111101111001011010111010100101111010111010101010000011111011001001100010011100111001011010110110111100111010101011111010101001111011001000101010011100111011111010011110001011111011001001111010001101111010101011000010101101111001101011010010100111111011001010001010001110111011001000001010010110111011111010011010001010111011111011110110001011111010111000111110101101 e99c93eb8285eb9c84e5b7a1ebba9eeb81bde5aea5ebaa83ec989ce5adbceabea9ec8a9cefa78bec9e8deab0ade6b4a7eca28eec8296efa68aefbd8beb8fad
UHC 霓낅뜄巡뺞끽宥몃옜孼꾩슜溜잍갭洧좎삖力k돭 111001111110011110000101111010111000110110001000111000101101111010010101111001101011001110100011111010101110100110111000111010111011111110111111111001011110110110000100111011001001101010101001111010101111111010011111111001101011000010111000111010101111101110100000111011001001100010011010111001101011001110100011111010111000100110110000 e7e785eb8d88e2de95e6b3a3eae9b8ebbfbfe5ed84ec9aa9eafe9fe6b0b8eafba0ec989ae6b3a3eb89b0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)