To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ??ゴ畏??遙??[??ゴ畏??遙??[^ 001111110011111110000011010100111000100011011000001111110011111111101010101000010011111100111111010110110011111100111111100000110101001110001000110110000011111100111111111010101010000100111111001111110101101101011110 3f3f835388d83f3feaa13f3f5b3f3f835388d83f3feaa13f3f5b5e
EUC-JP ??ゴ畏??遙??[??ゴ畏??遙??[^ 001111110011111110100101101101001011000011011010001111110011111111110100101000110011111100111111010110110011111100111111101001011011010010110000110110100011111100111111111101001010001100111111001111110101101101011110 3f3fa5b4b0da3f3ff4a33f3f5b3f3fa5b4b0da3f3ff4a33f3f5b5e
UTF-8 料썽ゴ畏붼쇋遙닻닎[料썽ゴ畏붼쇋遙닻닎[^ 111011111010011010111110111011001000110110111101111000111000001010110100111001111001010110001111111010111011011010111100111011001000011110001011111010011000000110011001111010111000101110111011111010111000101110001110010110111110111110100110101111101110110010001101101111011110001110000010101101001110011110010101100011111110101110110110101111001110110010000111100010111110100110000001100110011110101110001011101110111110101110001011100011100101101101011110 efa6beec8dbde382b4e7958febb6bcec878be98199eb8bbbeb8b8e5befa6beec8dbde382b4e7958febb6bcec878be98199eb8bbbeb8b8e5b5e
UHC 料썽ゴ畏붼쇋遙닻닎[料썽ゴ畏붼쇋遙닻닎[^ 111010001111011110111101111010011010101110110100111010001110011010010100111010011001100110111101111010011010101110110100111010011000100010010100010110111110100011110111101111011110100110101011101101001110100011100110100101001110100110011001101111011110100110101011101101001110100110001000100101000101101101011110 e8f7bde9abb4e8e694e999bde9abb4e988945be8f7bde9abb4e8e694e999bde9abb4e988945b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)