To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 厓る?瓦?????侑?????畑??岳??^ 11111010100011011000001011101001001111111000101010100010001111110011111100111111001111110011111110011000110100000011111100111111001111110011111100111111100101001010100000111111001111111000101001111000001111110011111101011110 fa8d82e93f8aa23f3f3f3f3f98d03f3f3f3f3f94a83f3f8a783f3f5e
EUC-JP 厓る?瓦??煐??侑?????畑??岳??^ 10001111101101001100011110100100111010110011111110110100101001000011111100111111100011111100100111111000001111110011111111010000110100100011111100111111001111110011111100111111110010001010101000111111001111111011001111011001001111110011111101011110 8fb4c7a4eb3fb4a43f3f8fc9f83f3fd0d23f3f3f3f3fc8aa3f3fb3d93f3f5e
UTF-8 厓る젌瓦귨쪕煐뷰퓥侑멨룵溜띹퓗畑뤹성岳롫꽪^ 11100101100011101001001111100011100000101000101111101100101000001000110011100111100100111010011011101010101101111010100011101100101010101001010111100111100001011001000011101011101101111011000011101101100100111010010111100100101111101001000111101011101010011010100011101011101000111011010111101111101001111000101111101011100111011011100111101101100100111001011111100111100101011001000111101011101001001011100111101100100001001011000111100101101100101011001111101011101000011010101111101010101111011010101001011110 e58e93e3828beca08ce793a6eab7a8ecaa95e78590ebb7b0ed93a5e4be91eba9a8eba3b5efa78beb9db9ed9397e79591eba4b9ec84b1e5b2b3eba1abeabdaa5e
UHC 厓る젌瓦귨쪕煐뷰퓥侑멨룵溜띹퓗畑뤹성岳롫꽪^ 11100100111011011010101011101011101000001000110111101000101111111000001011101111101001011000111111100111101111001011101011100100101111111000111011101010111000101011100011100101100011111010101011101010111111101000110111101000101111111000001011101111101001011000111111100111101111001011101011100100101111111000111011101011100001001011010101011110 e4edaaeba08de8bf82efa58fe7bcbae4bf8eeae2b8e58faaeafe8de8bf82efa58fe7bcbae4bf8eeb84b55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)