To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???????る?異??嶸????????^ 00111111001111110011111100111111001111110011111100111111100000101110100100111111100010001101100100111111001111111111101010110100001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f82e93f88d93f3ffab43f3f3f3f3f3f3f3f5e
EUC-JP ???????る?異??嶸????????^ 0011111100111111001111110011111100111111001111110011111110100100111010110011111110110000110110110011111100111111100011111011101111110100001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3fa4eb3fb0db3f3f8fbbf43f3f3f3f3f3f3f3f5e
UTF-8 溜사츧溜쒕졎溜る졎異덈졎嶸앸젣溜삳젶溜살뎠^ 11101111101001111000101111101100100000101010110011101100101110001010011111101111101001111000101111101100100100101001010111101100101000011000111011101111101001111000101111100011100000101000101111101100101000011000111011100111100101011011000011101011100011011000100011101100101000011000111011100101101101101011100011101100100101011011100011101100101000001010001111101111101001111000101111101100100000101011001111101100101000001011011011101111101001111000101111101100100000101011010011101011100011101010000001011110 efa78bec82acecb8a7efa78bec9295eca18eefa78be3828beca18ee795b0eb8d88eca18ee5b6b8ec95b8eca0a3efa78bec82b3eca0b6efa78bec82b4eb8ea05e
UHC 溜사츧溜쒕졎溜る졎異덈졎嶸앸젣溜삳젶溜살뎠^ 11101010111111101011101111100111101011101001110111101010111111101001110011101011101000001011101111101010111111101010101011101011101000001011101111101100101101101000100011101011101000001011101111100111101011101001110111101011101000001001110011101010111111101011101111101011101000001010101011101010111111101011101111101100101101011011000101011110 eafebbe7ae9deafe9ceba0bbeafeaaeba0bbecb688eba0bbe7ae9deba09ceafebbeba0aaeafebbecb5b15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)