To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 奪遜則奪造賊奪遜則奪造賊B 10010010010001001001000110111011100100011010010110010010010001001001000110100010100100011010111110010010010001001001000110111011100100011010010110010010010001001001000110100010100100011010111101000010 924491bb91a5924491a291af924491bb91a5924491a291af42
EUC-JP 奪遜則奪造賊奪遜則奪造賊B 11000011101001011100001010111101110000101010011111000011101001011100001010100100110000101011000111000011101001011100001010111101110000101010011111000011101001011100001010100100110000101011000101000010 c3a5c2bdc2a7c3a5c2a4c2b1c3a5c2bdc2a7c3a5c2a4c2b142
UTF-8 奪遜則奪造賊奪遜則奪造賊B 11100101101001011010101011101001100000011001110011100101100010011000011111100101101001011010101011101001100000001010000011101000101100111000101011100101101001011010101011101001100000011001110011100101100010011000011111100101101001011010101011101001100000001010000011101000101100111000101001000010 e5a5aae9819ce58987e5a5aae980a0e8b38ae5a5aae9819ce58987e5a5aae980a0e8b38a42
UHC 奪遜則奪造賊奪遜則奪造賊B 11110111101011001110000111100001111101101100111011110111101011001111000011100011111011101110010011110111101011001110000111100001111101101100111011110111101011001111000011100011111011101110010001000010 f7ace1e1f6cef7acf0e3eee4f7ace1e1f6cef7acf0e3eee442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)