To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????R?????^[?????R?????^[^ 001111110011111100111111001111110011111101010010001111110011111100111111001111110011111101011110010110110011111100111111001111110011111100111111010100100011111100111111001111110011111100111111010111100101101101011110 3f3f3f3f3f523f3f3f3f3f5e5b3f3f3f3f3f523f3f3f3f3f5e5b5e
SJIS-WIN 烏????R烏????^[烏????R烏????^[^ 10001001010001110011111100111111001111110011111101010010100010010100011100111111001111110011111100111111010111100101101110001001010001110011111100111111001111110011111101010010100010010100011100111111001111110011111100111111010111100101101101011110 89473f3f3f3f5289473f3f3f3f5e5b89473f3f3f3f5289473f3f3f3f5e5b5e
EUC-JP 烏????R烏????^[烏????R烏????^[^ 10110001101010000011111100111111001111110011111101010010101100011010100000111111001111110011111100111111010111100101101110110001101010000011111100111111001111110011111101010010101100011010100000111111001111110011111100111111010111100101101101011110 b1a83f3f3f3f52b1a83f3f3f3f5e5bb1a83f3f3f3f52b1a83f3f3f3f5e5b5e
UTF-8 烏숇죲溜쐍R烏숇죲溜쐍^[烏숇죲溜쐍R烏숇죲溜쐍^[^ 11100111100000111000111111101100100010001000011111101100101000111011001011101111101001111000101111101100100100001000110101010010111001111000001110001111111011001000100010000111111011001010001110110010111011111010011110001011111011001001000010001101010111100101101111100111100000111000111111101100100010001000011111101100101000111011001011101111101001111000101111101100100100001000110101010010111001111000001110001111111011001000100010000111111011001010001110110010111011111010011110001011111011001001000010001101010111100101101101011110 e7838fec8887eca3b2efa78bec908d52e7838fec8887eca3b2efa78bec908d5e5be7838fec8887eca3b2efa78bec908d52e7838fec8887eca3b2efa78bec908d5e5b5e
UHC 烏숇죲溜쐍R烏숇죲溜쐍^[烏숇죲溜쐍R烏숇죲溜쐍^[^ 1110100010100001100110011110101110100001100011011110101011111110100111000110111001010010111010001010000110011001111010111010000110001101111010101111111010011100011011100101111001011011111010001010000110011001111010111010000110001101111010101111111010011100011011100101001011101000101000011001100111101011101000011000110111101010111111101001110001101110010111100101101101011110 e8a199eba18deafe9c6e52e8a199eba18deafe9c6e5e5be8a199eba18deafe9c6e52e8a199eba18deafe9c6e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)