To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 燿??節??歪??汝 1110000010100000001111110011111110010000110111110011111100111111100110000110001100111111001111111001001111110000 e0a03f3f90df3f3f98633f3f93f0
EUC-JP 燿??節??歪??汝 1110000010100010001111110011111111000000111000010011111100111111110011111100010000111111001111111100011011110010 e0a23f3fc0e13f3fcfc43f3fc6f2
UTF-8 燿ⓨ넇節ㅿ슨歪딀퍍汝 111001111000011110111111111000101001001110101000111010111000010010000111111001111010111110000000111000111000010110111111111011001000101010101000111001101010110110101010111010111001010010000000111011011000110110001101111001101011000110011101 e787bfe293a8eb8487e7af80e385bfec8aa8e6adaaeb9480ed8d8de6b19d
UHC 燿ⓨ넇節ㅿ슨歪딀퍍汝 1110100011111100101010001110010110000110100101111110111110111101101001001110111110111101101111001110100011100000100010101110011010111011100001001110011010100011 e8fca8e58697efbda4efbdbce8e08ae6bb84e6a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)