To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 彫?宰?醫?池?雄?彫?宰?醫?池?雄?^ 10010010101001000011111110001101110010010011111111100111110011100011111110010010011100100011111110010111010110010011111110010010101001000011111110001101110010010011111111100111110011100011111110010010011100100011111110010111010110010011111101011110 92a43f8dc93fe7ce3f92723f97593f92a43f8dc93fe7ce3f92723f97593f5e
EUC-JP 彫?宰?醫?池?雄?彫?宰?醫?池?雄?^ 11000100101001100011111110111010110010110011111111101110110100000011111111000011110100110011111111001101101110100011111111000100101001100011111110111010110010110011111111101110110100000011111111000011110100110011111111001101101110100011111101011110 c4a63fbacb3feed03fc3d33fcdba3fc4a63fbacb3feed03fc3d33fcdba3f5e
UTF-8 彫렣宰렞醫렖池렡雄뒷彫렣宰렞醫렖池렡雄뒬^ 11100101101111011010101111101011101000001010001111100101101011101011000011101011101000001001111011101001100001101010101111101011101000001001011011100110101100011010000011101011101000001010000111101001100110111000010011101011100100101011011111100101101111011010101111101011101000001010001111100101101011101011000011101011101000001001111011101001100001101010101111101011101000001001011011100110101100011010000011101011101000001010000111101001100110111000010011101011100100101010110001011110 e5bdabeba0a3e5aeb0eba09ee986abeba096e6b1a0eba0a1e99b84eb92b7e5bdabeba0a3e5aeb0eba09ee986abeba096e6b1a0eba0a1e99b84eb92ac5e
UHC 彫렣宰렞醫렖池렡雄뒷彫렣宰렞醫렖池렡雄뒬^ 1111000011000001100011101011010011101110101001011000111010101111111011001010001010001110101010111111001010101110100011101011001011101010101010011011010111011110111100001100000110001110101101001110111010100101100011101010111111101100101000101000111010101011111100101010111010001110101100101110101010101001101101011101110001011110 f0c18eb4eea58eafeca28eabf2ae8eb2eaa9b5def0c18eb4eea58eafeca28eabf2ae8eb2eaa9b5dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)