To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?????????濡??幼?????娃?【B 0011111100111111001111110011111100111111001111110011111100111111001111111001010001000111001111110011111110010111011000110011111100111111001111110011111100111111100010001010000100111111100000010111100101000010 3f3f3f3f3f3f3f3f3f94473f3f97633f3f3f3f3f88a13f817942
EUC-JP ?????????濡??幼?????娃?【B 0011111100111111001111110011111100111111001111110011111100111111001111111100011110101000001111110011111111001101110001000011111100111111001111110011111100111111101100001010001100111111101000011101101001000010 3f3f3f3f3f3f3f3f3fc7a83f3fcdc43f3f3f3f3fb0a33fa1da42
UTF-8 玲곷뙎溜김몳杻쇤겣濡쇘퓖幼볥젿吳쏅뼹娃듬【B 11101111101001101010110111101010101100111011011111101011100110011000111011101111101001111000101111101010101110011000000011101011101010101011001111101111101001111000100011101100100001111010010011101010101100101010001111100110101111111010000111101100100001111001100011101101100100111001011011100101101110011011110011101011101100111010010111101100101000001011111111100101100100001011001111101100100011111000010111101011101111001011100111100101101010001000001111101011100100111010110011100011100000001001000001000010 efa6adeab3b7eb998eefa78beab980ebaab3efa788ec87a4eab2a3e6bfa1ec8798ed9396e5b9bcebb3a5eca0bfe590b3ec8f85ebbcb9e5a883eb93ace3809042
UHC 玲곷뙎溜김몳杻쇤겣濡쇘퓖幼볥젿吳쏅뼹娃듬【B 11100111101111111000000111101011100011001001001111101010111111101011000111101000100100011001101111101010111101001011110011101001100000011011010111101011101000011011110011100111101111111000000111101010111010101001001111101011101000001011000111100111111011111001101111101011100101101011110011101000110111111011010111101011101000011011110001000010 e7bf81eb8c93eafeb1e8919beaf4bce981b5eba1bce7bf81eaea93eba0b1e7ef9beb96bce8dfb5eba1bc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)