To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??異??兢誼??儒??繞???θ? 1110001010100011001111110011111110001000110110010011111100111111100110010101110110001011011000100011111100111111100011101111001000111111001111111110001110000101001111110011111100111111100000111100011000111111 e2a33f3f88d93f3f995d8b623f3f8ef23f3fe3853f3f3f83c63f
EUC-JP 筌??異??兢誼??儒??繞???θ? 1110010010100101001111110011111110110000110110110011111100111111110100011011111010110101110000110011111100111111101111001111010000111111001111111110010111100101001111110011111100111111101001101100100000111111 e4a53f3fb0db3f3fd1beb5c33f3fbcf43f3fe5e53f3f3fa6c83f
UTF-8 筌뚯떓異긴풚兢誼띰쭓儒멥뀊繞볥쓬劉θ퉪 1110011110101101100011001110101110011010101011111110101110010110100100111110011110010101101100001110101010111000101101001110110110010010100110101110010110000101101000101110100010101010101111001110101110011101101100001110110010101101100100111110010110000100100100101110101110101001101001011110101110000000100010101110011110111001100111101110101110110011101001011110110010010011101011001110111110100111100001111100111010111000111011011000100110101010 e7ad8ceb9aafeb9693e795b0eab8b4ed929ae585a2e8aabceb9db0ecad93e58492eba9a5eb808ae7b99eebb3a5ec93acefa787ceb8ed89aa
UHC 筌뚯떓異긴풚兢誼띰쭓儒멥뀊繞볥쓬劉θ퉪 1110111110100111100011001110110010001011101010011110110010110110101100011110010010111110100111011101000011100111111010111111111010110110111011111010011110001011111010101110001110111000111000111000010110000110111010011010010010010011111010111001110110001100111010101110010110100101111010001011100110000010 efa78cec8ba9ecb6b1e4be9dd0e7ebfeb6efa78beae3b8e38586e9a493eb9d8ceae5a5e8b982

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)