To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????Ø???^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111101100000111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fd83f3f3f5e
SJIS-WIN 哀??吟??癒る?怏??蹂??癲??萸??^ 100010001010001100111111001111111000101111100001001111110011111110010110111111001000001011101001001111111001110010001001001111110011111111100110111110000011111100111111111000011001111100111111001111111110010011001110001111110011111101011110 88a33f3f8be13f3f96fc82e93f9c893f3fe6f83f3fe19f3f3fe4ce3f3f5e
EUC-JP 哀??吟??癒る?怏??蹂??癲?Ø萸??^ 1011000010100101001111110011111110110110111000110011111100111111110011001111111010100100111010110011111111010111111010010011111100111111111011001111101000111111001111111110001010100001001111111000111110101001101011001110100011010000001111110011111101011110 b0a53f3fb6e33f3fccfea4eb3fd7e93f3fecfa3f3fe2a13f8fa9ace8d03f3f5e
UTF-8 哀얜퀬吟귨쫱癒る춴怏뉐맽蹂좎긾癲됰Ø萸먨뀑^ 111001011001001110000000111011001001011010011100111011011000000010101100111001011001000010011111111010101011011110101000111011001010101110110001111001111001100110010010111000111000001010001011111011001011011010110100111001101000000010001111111010111000100110010000111010111010011110111101111010001011100110000010111011001010001010001110111010101011100010111110111001111001100110110010111010111001000010110000110000111001100011101000100100001011100011101011101010001010100011101011100000001001000101011110 e59380ec969ced80ace5909feab7a8ecabb1e79992e3828becb6b4e6808feb8990eba7bde8b982eca28eeab8bee799b2eb90b0c398e890b8eba8a8eb80915e
UHC 哀얜퀬吟귨쫱癒る춴怏뉐맽蹂좎긾癲됰Ø萸먨뀑^ 11100100111011101011111011101011101100111010000011101011111000011000001011101111101001101000100111101011101010001010101011101011101011011001000011100100111010001000011111100101100100001011111011101011101100111010000011101100100000111000001011101111101001101000100111101011101010001010101011101011101011011001000011100101100001011000101101011110 e4eebeebb3a0ebe182efa689eba8aaebad90e4e887e590beebb3a0ec8382efa689eba8aaebad90e5858b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)