To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?x?xf?x?x^}Y?x?xf?x?x^}bE 00111111011110000011111101111000011001100011111101111000001111110111100001011110011111010101100100111111011110000011111101111000011001100011111101111000001111110111100001011110011111010110001001000101 3f783f78663f783f785e7d593f783f78663f783f785e7d6245
SJIS-WIN 達x達xf達x達x^}Y達x達xf達x達x^}bE 100100100100001001111000100100100100001001111000011001101001001001000010011110001001001001000010011110000101111001111101010110011001001001000010011110001001001001000010011110000110011010010010010000100111100010010010010000100111100001011110011111010110001001000101 924278924278669242789242785e7d59924278924278669242789242785e7d6245
EUC-JP 達x達xf達x達x^}Y達x達xf達x達x^}bE 110000111010001101111000110000111010001101111000011001101100001110100011011110001100001110100011011110000101111001111101010110011100001110100011011110001100001110100011011110000110011011000011101000110111100011000011101000110111100001011110011111010110001001000101 c3a378c3a37866c3a378c3a3785e7d59c3a378c3a37866c3a378c3a3785e7d6245
UTF-8 達x達xf達x達x^}Y達x達xf達x達x^}bE 1110100110000001100101000111100011101001100000011001010001111000011001101110100110000001100101000111100011101001100000011001010001111000010111100111110101011001111010011000000110010100011110001110100110000001100101000111100001100110111010011000000110010100011110001110100110000001100101000111100001011110011111010110001001000101 e9819478e981947866e9819478e98194785e7d59e9819478e981947866e9819478e98194785e7d6245
UHC 達x達xf達x達x^}Y達x達xf達x達x^}bE 110100111011100101111000110100111011100101111000011001101101001110111001011110001101001110111001011110000101111001111101010110011101001110111001011110001101001110111001011110000110011011010011101110010111100011010011101110010111100001011110011111010110001001000101 d3b978d3b97866d3b978d3b9785e7d59d3b978d3b97866d3b978d3b9785e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)