To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 肢私??中??磐??肢私??中??磐??^ 1000111010001000100011101000010000111111001111111001001010000110001111110011111110010100110101100011111100111111100011101000100010001110100001000011111100111111100100101000011000111111001111111001010011010110001111110011111101011110 8e888e843f3f92863f3f94d63f3f8e888e843f3f92863f3f94d63f3f5e
EUC-JP 肢私??中??磐??肢私??中??磐??^ 1011101111101000101110111110010000111111001111111100001111100110001111110011111111001000110110000011111100111111101110111110100010111011111001000011111100111111110000111110011000111111001111111100100011011000001111110011111101011110 bbe8bbe43f3fc3e63f3fc8d83f3fbbe8bbe43f3fc3e63f3fc8d83f3f5e
UTF-8 肢私렎렠中찔렱磐렱쌨肢私렎렠中찔렱磐렱쌤^ 11101000100000101010001011100111101001111000000111101011101000001000111011101011101000001010000011100100101110001010110111101100101100001001010011101011101000001011000111100111101000111001000011101011101000001011000111101100100011001010100011101000100000101010001011100111101001111000000111101011101000001000111011101011101000001010000011100100101110001010110111101100101100001001010011101011101000001011000111100111101000111001000011101011101000001011000111101100100011001010010001011110 e882a2e7a781eba08eeba0a0e4b8adecb094eba0b1e7a390eba0b1ec8ca8e882a2e7a781eba08eeba0a0e4b8adecb094eba0b1e7a390eba0b1ec8ca45e
UHC 肢私렎렠中찔렱磐렱쌨肢私렎렠中찔렱磐렱쌤^ 1111001010110110110111101110011110001110101001001000111010110001111100011110100111000010111100011000111010111110110110101111000110001110101111101011110111011110111100101011011011011110111001111000111010100100100011101011000111110001111010011100001011110001100011101011111011011010111100011000111010111110101111011101110001011110 f2b6dee78ea48eb1f1e9c2f18ebedaf18ebebddef2b6dee78ea48eb1f1e9c2f18ebedaf18ebebddc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)